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3. 



♦1 



20 



&0 



TGAGCCTAAACCTttCCCrCACCCATTTCAACCAGTCTAG 

T actaoctcaccaccatccccaacatctacaaT Cux X IGAU I I I I attt aatoctogccct actgctcc 

CAACGTTATCCAItntUACTAOTClJUgAATATTTGCAACATC IC T CCTTTT C1 11 ACTTATCGCC G fOC T CG I GC 

CAACCTIiatCAOTCACCCAICrasSCAACCCCACAAGTA^ 

ACATCCCAATCCCCCTCTCCTTI ILL I l ACTCArGCCCGT CC T GCT C C 

ccaaccttcactgttacccctcaxcaaccacccca{xagca reoccn tec x ttaatgatccccctgctgctcc 



40 



80 



100 



120 



140 



TCACCTCCAACTCAACCTCCTStCTGGOCTCTCAlTflCCOT 

TCMgtAOUGTCATTCALLIllLiLUXlLTSAlll^ rCCGAAGAAT 

TCACCTlCAAATCCA aiUlULiUXLIC TQ UtlLU.reA & 

TCACCTOCAAGTCAAGCTCCrcrCttSCaCTCrGATCTCCCTG 

CTG U.111UA^1C TGATCTGCC TOC CCCCA C J^^ 
TCACCTACAAAJTXA it. lul ILiUiA^lur GA lClUXtC ACACCCACACa^CGCTAATACGA aAXJ,! I L ArACTCCTGCCACAAATCCSAACAAT 



TTUCCTGaUtfItAAOtIUUUtllili^ll.lAAlLKH.rC**^CCa>CACCeiCAATI 



VTGCTCA rGCCACAAATCAGGAGAAT 



ISO 



180 



200 



220 



240 



C tC I I - 1 1 1 VL ICS 1 \>,f ftt^ A ^ rA ™ ArTr ^^ rrTrrr CAGGAGGaGTTT GGCAACCAC 1 rOCAAAAGGCTGAAACCATCCT 1 6 fCCT 
LlLiLLlIlLlXtaLl l j <i«« Ar A fi fl rft rrJMTTTTr^ATT > '*T^ C AGGAfl nA tTTTTGATGATAAACAGTTOCACAAGGCTCAAGCCA 1 1 fL I GTCCT 
CitlU^IiaiLLllXClC AACQU r A GACA raUTTCCGAATCCCC CACCAGCAGTTTGATGGCAACCAGTTCCAGAAGGCTCAACCCA I C T C r C TCCT 
CrCTCCTTanPCTCTXTtGA T CCACACACA TGAI " 1"- '™ C AC GACC ACTrTCATCCC A AC n A CTO CACAA G GC rC C A C CC AfC t t T C X C CT 
C rCTCCTTTTTCTt 4CCT C4 ft ^^* r *^*T UbCrTTCATTTTCCATCATCA U< ILljlLA TOCCAACCACTTCCACAAC CTTCA ACCrATCTTCC^^ 
CTCTCCTTTCTCCTCC 1 . T 17 ft ftff*T **** * T **"^^' rr> ^* TT ** rf ' r CAACACCA C1 ! t CA TGCCAACCA G TT CC ACAACCCTCAAC CC A ill iu I 

CATGACTrTCOATrrCCT CACC ACT ACTTTGATCGCAACCAL 1 1 UJU^GGCXAACCCA 1LILI01 ccr 

CTcTCcn tt t> x i m T ~rftr??^ m ^**™ *™™ AA ' mrrr caggacgaatttcatggcaaccacttccacaaagctcaaccca nrrgrcTCCT 

260 280 , 300 . 320 U0 

CCATGACATCATCCAGCAGATCTTCAA fCICTTCAOCACAAAGC ACTCA It rOCTCCI r COXi ATCACACCCTCCTACACAAATTCTACACTCAACTCTAC 
C^IGACATCATCCAgCAGACCTTCAACCTCTC^ I1X ILLU I LLA TGAGACtXTTCrAGATGAATTCTACATCCAA CTTC AC 

CCATCACATCATCCAGCACACCTTCAATCTCTTCA^ 

CCATGAGCTCAICCACCACArSTTCAACCTCTTIACCACAAAAGA ITCA FC TU1 I CC 1 1 CG GATCACCACCTCCTAGACAAATTUTGCACCGAACTCTAC 

CCATCAGATCATCCAGCACACCTTCAA^XTCTTCAGCACAAAW 

CCATGAGAXOOTXACCACAiXTTCAATCTCTTC^^ 

ccatgacaicatccaccaga csttc aa nrrcTTCACc^cAXA^ actca nrrccr actttogatgacacacttctac acaaattctaca^tcaactt^ac 

CCATGAGAXCATGCACCAGACCTTCAATCTTrrTCAGCACAAACA I G GCATCACACCCTCCTACAAAAATTCrACATTCAACTTTTC 



400 
* 



420 



440 



tTCAACGACGACTCC A rTCTCGCTCTCACGAAATACT 



360 380 

• » • » 

CACCACCTCAATGACCTOCAAGC.. 10 rCTGATAa . _ _ . 

CACCACCTCAATCACCTCCAA l rSCTO I G I C A^CAGGAAGTCCGGGTSATACA G T C T ULCU T GA TGTACGAGGACTCCA rC CTGCC K I C ACGAAATACT 
CACCAACTCAATCACCTGGAACCAICTGTCA T Ar Afi Q tfWTTCCCGTCCAACAG AC i a C C t GA TGAATCAGCACTCC A fCCTCCCTGTGAGGAAATACT 
CAGCACCTCAATCACTTGGAA CCCI L 1C rC AIGCACCAGCAC AGGCTCGGAGAAAi: I'XO: rGArGAATGTGGACTCCA rCTTCGC! L 1 GAAGAAATaCT 
CACCACCTCAATCACCTCCAAiaCCnrrCTCA^ 

CACI><XTQUTCACATCCAA UX4U^ r S ArA<UCCACGTTCCO I LL L IGTS AAGAAATACT 

CA jG C AG CffiiATBA^XT CGAACCg rGTATGATC 

CACCAAATCAATCACCTGCAA CCCTC 1U I CAT A C ACCA U.1 rSCGC rCCAAGACAC ii X CC T G ATCAATGACGACTCCA FCC t WCIUT'JAAGAAATACT 



440 



680 



500 



320 



340 



TCCAAACAArCACirTCTATCTCAAACACAAgAAATAC 

7CCAAACAATCACTSTATATCT C A C > fi AA A A fl A A A T AC AW. WTTC fGCC ICOGAGGTTGTCACACCACAAATCATGAGA i'X XTCTCTTI ATCA ATCAA 
TaLAAACJUTt^rnrrTTATCTAAT^^A^ ltv,i 1 1 rC AACAAA 

TT TAfl *r.>rt Tr i Nr rnTTftT^^^^ Ag ^^ AT ^-'-'- < 1 PCI rM. r M ^tu=CTCTC* r A rt rxrAAATtllTGA£^ ^ 

TfT 1 1 ACA I T CACTCTTTATCT C A C A C A O * A G * 4AT ACA CCCC I IU IU.UUC CACCTTCICACACCACAAATCATCAGATC L 1 fCTCI I U TCAAAAAT 
TTCAAACAATCACCCTCTATCTCA r ArAr<A^ilU>*TACA OU^.l IC r GC AIGCGACGTTCTCACAGCACAAATCATCAGA 1LC * 1 LI L1 1 1 ATC AGCAAA 
TCCAAACAATCACTCTITATCTCATCCAfi A A C M A t ACACCCCTTSTCCCTCCCACGTrCTCACACCACAMTCATCACATC L rtCPCTm fCA ACAAA 



340 



580 



400 



420 



440 



fTre rrr *« TTT* HC* tCtl ICCAATCAAAACTGCTTCAAjCATOGAAATGATTTTC^ 
TT T'V ftl M ^nrr^"^ k ^ AAMA mcAe ^ m ^ A ^ i ^ XLA ^ TTm 
TrnV * ^'TTAACCACC t^ATT^XAAACTtXTTTCAACATWCAATCATCCTC^ 
CTrrcAACAAACATTAACGACCAAO^TAAXATCn«rr^^ 
CTTGCAOOUUkCATTAACCAGCAAfiCAATtlAAAACTCGTT^ 

TTTTCAAGAAACA rTAACCACCAACGAAT^lAVuCCCTTTCAACA rCCAAATCATCTCTATTGACTAAXACACCAGTCCACAUl rC TATC AGTTCTOCCAT 

cttccaacaaagattaaccacca^xaaTCXaaac^^ 

CTTCCAAAAA I Hi rTAACCACC > A^ATTg^AAACTCCTTCATCATGGAAATCA 



460 



480 



700 



720 



740 



TCAAAGJCTC A1U 1 1 11 1 U. L ATCACCATCACAOCArTTAA A 1L 1 1 1 lU UATCTITTrAGCAGTATTAATCAACATrG XATTCAGCTCTTAAGCCACTA 
TTTTftflflfytl. U t i 1 1 It, I" 1 ****^*T0CTAlGAATTCAATCAAATCT5TCAAt/lUt 1 1 iLA GCACTCTTAAGCAACATS CTOTTCA CCTCTATSC 
TTT£JULMC A CTVVCTTrth T * f^ Aime **^ :nT n ™ A '^ A ' LA A ITTTCAAATGTTTTCAGCAG7CTAAACAAG n^lKC f tf f ATAgrrCTCCACCCACTAC 
TTTC AAACAiTrrrCACCtXTOCTATAJCT ATCACCA TCCTGA f AAACTCATTTATC TATTT AAATA TTTATff AACTATT ^ 
TgCCATTTCTCATTTTTCCTATAICCATCACATGAG^ t ! I ' ^ ACGAATGTTAACCA C CATCATGTTCACC^^ 

TTCAAACACTC AI I >L ILL 1A TAACCACCCCATCACTTCAATCAAAATTTTCAGATC7TTTC 

TTCAAATATTAA I rXTCC t ATATCATGACTtCACTTCAATCAAAA fTTTCAAACG [' I ICACAOi it 1 tAACCAACAOrrCT TTACC TCCACACCGACA 
Ti^UU>CACTCACTT t rrA7AACCACCAffAAGTrCAATCAA^ 



?40 



760 



300 



920 



140 



^f^^^i/^Af-^mttUffl^Tf^TTAra 

CCACTACrCCCTTACAGArCACCAIGCXCA tCCATCTA rTCArCTATTTATTTAA A TC 1 1 «' ATTTACTTAACTACTAr ACCCACTTAAA ITAL 111IGI1 
TrrTTT nr A C A Tn AfT A TTrrCAl' V I' i * I 1 - 1 " * " ' ! ' rCl I a. a. it ittt 1ATT *TTy~rTAA.iATTT XT^T kAT xT<\aTCA£X^GC rTTAflAT 

TCTTCATATAACGTCATGTGCACCTTTACA C IV 1 1 AGTCTAATAAAAC AtM ltd lA TATTTACTC-poly< A) 

CT ACTTCCTTACGGATC ATCATCCTCATGGATCTCTTT fCT iLU 1 itATCTATC 

CTACTCCrrTACACATCACCATCCICAtACAirrAATTATCrAW 

AAATCTTtACACATCATCATgCCAATCrATCTATTCTATCTATTTATCTA TC fCTCT TUTATCfAATCTATTTAAATATTTATTTATTTATAAGAT 
CTTT A CA C A Te A CC ATTCTC AIUlUlXU ICAICrATTTATTTAAATATTTATTTATTTAACTATTTTTATTATTTAAArTAT^ 



S60 



860 



900 



320 



940 



ATTACCTCATCTCCA CU 1 1 1 <JU C Mil\X, I tA ATCTAATAAAAT A 11*111. 1 1 III ATTTGGT - po ly(A> 

CATATTA^AlTATGTCAACTTTtACATTGTCAATtGTGTAAC^^ 

gSTCCI lA AICTAACAATAT AlU 1 1L I ILATATTTACCCAATATATTAA ilia 1 11 1 IC ATTAAATT TTTACT AT AC • po ly (A ) 
ATOTATTTTTACn rCTGCT I AAIA TA ACA A C ACATC; IU11 tA TATTTACTCAATATATTA Cl 1I W.U fTTCATTAAATTTTrACTATCC-' po 1 y U> 
ATPTCTACTTTTACATTCTCTTmTrAAAATATGTTAATCTAATATTT rtTICTTTl 1ATTAAT TTTT AC TATTAAA AGTTCTTAT 

TTAAATTATTTTAAaCTT AIU rfSCT rCA CGTAATATTACATCCACCTTA C I . tCTGGC f A ATATAA TAAAAT A iU i K i i [ATCTn rCTC -POly ( A) 
TCACTACCrrtACAX rCTCCl 1A ATG T A A C A AA TA ICTTCX tC ATATTTACCCAATATATTA AX I ILL till rCATTAAATTTTTACTAT- po 1 y ( A) 



340 



980 



1000 



AATICTTTJg 1 1 A TTCTTTAAAATTCAACTCCAACCCATCAATTCTCCAAACTGATTA If ( A) 

ATTATTTCCTtAITCTTr AATAAAGAAATTCCAAGCCC - po ly ( A ) 
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Xba 



Pst I Sau 3a 



Eco RI Sau 3a 

f 



Eco RI 



STOP 



Pst I 



110 bp 



132 bp 



GGCTGT GATCTG 



700 bp 




7a. 



HindJH EcoRI 
I 1 



Xba Eco RI Sau 3a 
— I 1 — 



-I 



Trp 




Le-IFA 

£ 



Pst I 



ATGTGT 



GATCTG 
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GTATSTTOCTA 
GTATSTTCCCTA 
STATSTTCCTTA 
GTATSTTOCTA 
GTATSTTCACTA 



TTTAAfiGC-TAfiGCACAMfiCAABinCTTCAfiAfiAACCTGG 

TTTAAGGC -TAG^ACaJuEkAAi^TCT^ AGAGAACCTGGAGCCTAAGGTTTASGCTCACCCATT -TCAACCAGTCTAfiCAGCATCTGCAACATCTACA 

mAASACCTATO^GASCAAOT 
TTTAAGSCCTATGCACAfiAfiCAAA6TCTTCASAAAACCTW 

mAASACCTATTOCAfiAfiCAAASTCTCCAfiAAAACCTA 



♦I 



100 



ATQSCCTTSACCTrTGCmACTG^ 

XTBSC C SlCC I I CI I f A CTGATSSCCSTGCTSGTCCTCAGCTACAAATCCATCTGnCTCT 

200 

6CAfi6AGfiACCTT6AT6CTCCTSGGACAGATSAfi6AGAA fCTCTCl 1 1 Jl yiCT6CTTSAASSACAfiACATGACTrTBSArrTCCCCAfiSAfiSAfiTTT«~ 
ACAGGAGGACTTTl^TIXTCATGGCA^ 

atafisagsgccttwtactcctggcacaaatggsaafiaatctcto 
ataoagggcc^ 

ATAGGAGGG^GATACra 



-GfiCAACCASTOCAAAAfiSaGAAACaTCCCTGTCCTCCATGASATG^ 
TGSCAACCASTTCCAGAAAGCTCAASCaTCTCTSTC^ 

tS^ttccacJ^ 

TeSCCACCAfimCABAASACTCAAaCATCrCTCTCCTCa^ 

tSaaccact 

400 

6ATSA6ACCCTCCTA6ACAMnCTACACT6AACTCTACCAaCAa 

GATGA^CCCTCCTASAAAMnCT^ 

GAACJU^TOTAGAAA^ 

SAACAGAGCCTCCTAGAAAMTmCCACTGAAC^ 

SAACAfiASCCTCCTAGAAAMTmCCACTSAMmACCAGCAACT 

500 

TGATGAAGGAGGACTCCATTCTGGCTGTGAGGAAATA^ 

T6AT GAATGAGGACTCCATCC TS GC TGTGAAGAAAT ACTTCCAAA6AAT CAC TCTnATCTGATGGAGAAGAMT ACAGCCCTTGTGCCT5GGAGSTT6T 

TGATGAATGAGGACTCCATCCTOiCTSTGAGGAAATACTTCCAAAGAATCAC 

TSATSAATSAGGACTTCATCCTSGCTBTGAGGAAATACTTCCAAAGMTCACTCTnATCW 

TGATGAA^^ 

600 

CAfiA6CA6AMTCAT6A6ATCTrrrTCTrT6TCAACAAACTTfi^ 
CAGAGCAGAAATCATGA6ATC CCTCTCTTTTTCAACAAACTTCCAAAAAAGAn 
OGAGCAGAMTCATGAGATCTCTCTCTTTTTCAj^ 
CAGAGCAGAMTCATGAGATCCHCTCTTmCAACAAtt 
CAGAGCAGAMTCATGAGATCCCTCTCGTTTTCAACAAACTT6 

7 00 

TSAnCGTATGCCAfiCTtACCTTTTTAT6ATCT6CCATTTCAAA6ACTW 
TGAC TAATACATCATCTGACAC TnCATSAGTT CnCCATTTCAAAGACTCA^ 
TGACTAATACAnATCTCACACrrrCATGAGTTCCTCCATTTCAAAGACT^ 

TGACTAATGCATCATCTCACAC TTTCATSAGTTCTTC CATTTCAAAGACTCACTTCTATAACCACCACAAfiTTCMTCAAAATTTC^ 
TBACTMTACATTATCTCACACTTTCATSAfiTTCTTCCATT^ 

800 

GA6TATTMTCAACATTGTATTCAGCTCTTAAG6CACTA6TCCC^ 
AGGAGTGTAAAGAAGCATCATGTATACCTGTGCAGGCACTAGra 
AGTSTAAAGAAGCGTCCTCTATACCTGTCCAGGCACTAGTACmACAW 
AtfltflTAAGAAGCATCSTSTTTACCTSTGCAGGCACTAGTCC^^ 

AGTCTAAAGAAGTCTCGTGTATACCTSTGCAGGCACTAfiTCCTnACA^ FTT6T1 1 AAATATTTATTTAATT 

900 

TTTATTTAACTATTTATAAAACAACTTA 1 1 1 1 1UI 1 C ATATTATGTCATGTGCACCTTTGCACAgTSSTTAATG TAATAAAATGT b I IUI i ibUTTTGG 

ATTTAACTATTTTTATTATnAMTTATTTTnATGTTAATATCXTGTSTACC 

TATTTTTAAfiATTTAAATTATTTTTTTATGTMTATWTBTCTA 

TAACTArrTTTArrATTTAAATTATTTTTTATGTAATATCATATGTACCTn 

ATTTTTAAAATnATGTMTATCATGAGTCGCTTT^ 

1.000 

TAAAmATTTTGTGTTGnCATOAACTTTTOT 

TATATTAA I I I CC I 1 1 1 1 t ATTAAATTTTTACTATACAAA Al I ICIUIUI i I bCTATTT 
MmCCTrmWnAMTTmACTATACA^ 

TTAATTTCCTTTTT CATTAAATTTTTACTATACAAAATTTCnGTGTTTGTTTA^ 
nAAATTTTTACTATACAAAATTTCTTGTGTTTGTTTATTCTnAAGA 



9/3 



zmm 



A 

H 
I 
J 
C 



SI 
H A 
M A 
M A 
M A 
M A 



L 
L 
L 
R 
L 



T F 

P F 

S F 

S F 

S F 



A 

S 
S 
S 
S 



L 
L 
L 
L 
L 



$10 
L V A L 
M M A L 
L M A V 
LHY V 
L M A Y 



L 
V 
L 
L 
L 



Y 
Y 
Y 
V 
V 



L 
L 
L 
L 
L 



S 
S 

s 
s 
s 



c 
c 

Y 
Y 
Y 



K 
K 
K 
K 
K 



S 
S 
S 
S 
S 



S20 

s c s 
s c s 
I c s 
I c s 
I * s 



V 
L 
L 
L 
L 



G 
G 
G 
G 
G 



1 
C 
C 

c 
c 
c 



D 
N 
0 
D 
D 



L 
L 
L 
L 
L 



P 
$ 
P 
P 
P 



0 
Q 
0 
0 
0 



T 
T 
T 
T 
T 



H 
H 
H 
H 
H 



S 

s 
s 
s 
s 



L 
L 
L 
L 
L 



10 
G S 



H 

G 
R 
G 



N 
N 
N 
N 



R 
R 
R 
R 
R 



R 
R 
R 
R 
R 



T 
T 
A 
A 
A 



L 
L 
I 
L 
L 



A 
H 
I 
J 
C 



M 
M 

I 
I 
I 



t 
L 
L 
L 
L 



L 
M 
L 
L 
L 



A 
A 
A 
A 
G 



20 
Q M 
0 M 
Q M 
Q M 
Q M 



R R I 
R Tf I 
R 
R 
R 



S 

s 
s 
s 
s 



L 
P 
P 
P 
P 



F 
F 
F 
F 
F 



S 
S 

s 
s 
s 



c 
c 
c 
c 
c 



30 
L K 



L 
L 
L 
L 



K 
K 
K 
K 



0 
D 
D 
0 
0 



R 
R 
R 
R 
R 



H 
H 
P 
H 
H 



D 
0 
0 
E 
0 



F 
F 
F 
F 
F 



G 
E 
G 
R 
R 



F 
F 
L 
F 
I 



P 
P 
P 
P 
P 



40 
0 E 



Q 
Q 
E 

Q 



E 
E 
E 
E 



E 
E 
E 
E 
E 



F 
F 
F 
F 
F 



0 
D 
D 
0 



G 
G 
G 
G 
G 



N 
N 
H 
H 
N 



Q 
Q 
0 
Q 
Q 



F 
F 
F 
F 
F 



0 
Q 
Q 
Q 
Q 



50 
K A 
K A 
K T 
K T 
K A 



E 

Q 
Q 
0 
Q 



T 
A 
A 
A 
A 



A 
H 
I 
J 
C 



I 
I 
I 
I 
I 



P 

s 
s 
$ 
s 



V 
Y 
Y 
V 
Y 



L 
L 
L 
L 
L 



H 
H 
H 
H 
H 



60 
E M I 
EMM 
E M I 
E M I 
E M I 



Q 

Q 
0 
Q 
Q 



Q 
Q 
Q 
Q 
0 



I 

T 
T 
T 
T 



F 
F 
F 
F 
F 



N 
N 
H 
H 
H 



I 
L 
I 
L 
L 



F 
F 
F 
F 
F 



S 
$ 
S 
S 
S 



70 
T 
T 
T 
T 
T 



K 
K 
E 
E 
E 



0 
H 
0 
0 
0 



S 
S 
S 
S 

s 



$ 

S 

s 
$ 

S 



A 
A 
A 
A 
A 



A 
A 
A 
A 
A 



W 

w 

w 

M 
W 



D 
0 
E 
E 
E 



E 
E 

Q 
Q 
Q 



80 
T I 
T L 
S L 
S L 
S L 



L 
L 
L 
L 
L 



0 K 
E K 
E K 
E K 
E K 



F 
F 
F 
F 
F 



Y 
Y 
S 
S 
S 



T 
I 
T 
T 
T 



E 
E 
E 
E 
E 



L 
L 
L 
L 
I 



90 

Y Q 
F 0 

Y Q 

Y Q 

Y Q 



A 
H 
I 
J 
C 



Q 
Q 
Q 
0 
Q 



L 
M 

L 
L 
L 



H 
H 
N 
H 
H 



D 
0 
N 
D 
D 



L 
L 
L 
L 

L 



E 
E 



A 
A 



E A 
E A 



C 
C 

c 

A C 



100 
C Y I 
Y 
Y 
V 
Y 



I 
I 
I 
I 



Q 
Q 
Q 
Q 
Q 



G 
E 
E 
E 
E 



Y 
Y 
V 
Y 
V 



G 
G 
G 
G 
G 



V 
Y 
M 
V 
Y 



T 
E 
E 
K 
E 



E 
E 
E 
E 
E 



110 
T P L 
T P 
T P 
T P 
T P 



L 
L 
L 
L 



M 
M 
M 
M 
M 



K 
H 
N 
N 
H 



E 
E 
E 
E 
E 



0 
0 
0 
0 
0 



S 

s 
s 

F 

S 



I 
I 
I 
I 
I 



L 
L 
L 
L 
L 



120 
AYR 
A Y 
A Y 
A Y 
A Y 



R 
R 
R 
R 



K 
K 
K 
K 
K 



Y 
Y 
Y 
Y 
Y 



F 
F 
F 
F 
F 



Q 
Q 
Q 
Q 
Q 



R 
R 
R 
R 
R 



I 
I 
I 
I 
I 



T 
T 
T 
T 
T 



L 
L 
L 
L 
L 



A 
H 
I 
J 
C 



130 

Y L 

Y L 

Y L 

Y L 

Y L 



K 
M 
T 
M 

I 



E 
E 
E 
E 
E 



K 
K 
K 
K 
R 



K 
X 
K 
K 
K 



Y 
Y 
Y 
Y 
Y 



S 
S 
S 

s 
s 



p 
p 
p 
p 
p 



140 
CAW 
A W 
A W 
A W 
A W 



E 
E 
E 
E 
E 



V 
Y 
Y 
Y 
Y 



Y 
V 
Y 
V 
Y 



R 
R 
R 
R 
R 



A 
A 
A 
A 
A 



E 
E 
E 
E 
E 



I 



150 
I M R S 
MRS 
I M R S 
I M R S 
I M R S 



F S 
L S 
US 
F S 
L S 



L 
F 
F 
F 
F 



S 
S 
S 
S 
S 



T 
T 
T 
T 
T 



N 
N 
H 
H 
H 



L 
L 
L 
L 
L 



160 
Q E S 
Q K R 
Q K 1 
KK6 

q k r 



L 
I 
L 
L 
L 



R 
R 
R 
R 
R 



S 
R 
R 
R 
R 



K 
K 
K 
K 
K 



E 
0 
D 
0 
D 



1 



GB 2 079 291 A 1 



SPECIFICATION 

Interferons and Process for Their Preparation 

The present invention relates to the field of recombinant DNA technology, i.e. to processes used 
in recombinant DNA technology and to products obtained by these processes. 
5 In a more detailed aspect the present invention relates to polypeptides, specifically to mature 5 

human leukocyte interferons, to pharmaceutical com positions containing them and to a process for 
their preparation which comprises causing a culture of a microorganism transformed with a replicable 
microbial expression vehicle capable of expressing said polypeptides to grow up and express said 
polypeptides. The present invention also comprises the expression vehicles used in this process and the 
1 0 novel microorganisms containing these expression vehicles as well as the processes for their 1 0 

preparation. Finally, the invention relates to DNA sequences comprising sequences coding for the 
amino acid sequence of a mature human leukocyte interferon. 

Background of the Invention 

Human leukocyte interferon (LelF) was first discovered and prepared in the form of very crude 

1 5 precipitates by Isaacs and Lindenmann (Proc R. Soc. B. 1 47, 258 — 267 [1 957]; U.S.P. 3.699.222). 1 5 
Efforts to purify and characterize the material have been ongoing since that time, and have led to the 
preparation of relatively homogeneous leukocyte interferons derived from normal or leukemic donors* 
leukocytes {German Offenlegungsschrift No. 2.947.1 34). These Interferons are a family of proteins 
known to possess the ability to confer a virus-resistant state in their target cells. In addition , interferon 

20 can act to inhibit ceil proliferation and modulate immune response. These properties have prompted 20 
the clinical use of leukocyte interferon as a therapeutic agent for the treatment of viral infections and 
malignancies. 

Leukocyte interferons have been purified to essential homogeneity (Rubinstein et al., Proc. Natl. 
Acad. Sci. U.S.A. 76. 640—644 [1 979]; Zoon et al.. ibid. 76. 5601—5605 [1 979]), and reported 

25 molecular weights range from about 1 7,500 to about 21 ,000. The specific activity of these 25 
preparations is remarkably high, 2 x 1 0 8 to 1 x 1 0 9 units/mg protein, but yields from cell culture 
methods have been discouraging ly low. Nevertheless, advances in protein sequencing techniques have 
permitted the determination of partial amino acid sequences (Zoon et al.. Science 207. 527 [1 980]; 
Levy et al., Proc. Natl. Acad. Sci. U.S.A. 77, 5102 — 51 04 [1 980]). Elucidation of the glycosylation of 

30 various leukocyte interferons is not at present complete, but it Is now clear that differences in 30 
glycosylation among family members does not alone account for the spectrum of molecular weights 
observed. Instead, the leukocyte Interferons differ markedly in amino acid composition and sequence, 
and amino acid homology Is, in some cases, less than 80 percent 

While isolation from donor leukocytes has provided sufficient material for partial characterization 

35 and limited clinical studies with homogeneous leukocyte Interferon, it is a totally inadequate source for 35 
the amounts of interferon needed for large scale clinical trials and for broad scale prophylactic and/or 
therapeutic use thereafter. Indeed, presently clinical investigations employing human leukocyte- 
derived interferons in antitumor and antiviral testing have principally been confined to crude (<1 
percent pure) preparation of the material, and long lead times for the manufacture of sufficient 

40 quantities, even at unrealistic price levels, have critically delayed investigation on an expanded front 40 
With the advent of recombinant DNA technology, however, the controlled microbial production of 
an enormous variety of useful polypeptides has become possible. Already in hand are bacteria modified 
by this technology to permit the production of such polypeptide products such as somatostatin, the A 
and B chains of human insulin and human growth hormone (Itakura et al., Science 198. 1056 — 1063 

45 [1 977]; Goeddel et al., Nature 281. 544 — 548 [1 979]). More recently, recombinant DNA techniques 45 
have been used to occasion the bacterial production of prbinsulin and thymosin alpha 1 and several 
authors have reported on the obtention of DNA coding for human leukocyte interferon and to resultant 
proteins having leukocyte Interferon activity (Nagata et al.. Nature 284. 316 — 320 [1 980]; Mantel et 
al.,Gene 10. 1—10 [1980]; Taniguchi etal.. Nature 285. 547 — 549[1980]). 

50 The workhorse of recombinant DNA technology is the pfasmid, a non-chromosomal loop of 50 

double-stranded DNA found In bacteria and other microbes, oftentimes In multiple copies per cell. 
Included In the information encoded In the plasmid DNA is that required to reproduce the plasmid in 
daughter cells (i.e., a "repllcon") and ordinarily, one or more selection characteristics such as, in the 
case of bacteria, resistance to antibiotics which permit clones of the host cell containing the plasmid of 

55 interest to be recognized and preferentially grown in selective media. The utility of plasmids lies in the 55 
fact that they can be specifically cleaved by one or another restriction endonuclease or "restriction 
enzyme", each of which recognizes a different site on the plasmidic DNA. Thereafter heterologous 
genes or gene fragments may be Inserted into the plasmid by endwise joining at the cleavage site or at 
reconstructed ends adjacent to the cleavage site. DNA recombination is performed outside the cell, but 

60 the resulting "recombinant" plasmid can be introduced into it by a process known as transformation 60 
and large quantities of the heterologous gene-containing recombinant plasmid are obtained by 
growing the transformant Moreover, where the gene is properly inserted with reference to portions of 
the plasmid which govern the transcription and translation of the encoded DNA message, the resulting 




expression vehicle can be used to actually produce the polypeptide sequence for which the inserted 
gene codes, a process referred to as expression. 

Expression is initiated in a region known as the promoter which is recognized by and bound by 
RNA polymerase. In some cases, as in the trytophan or "trp" promoter preferred in the practice of the 
5 present invention, promoter regions are overlapped by "operator" regions to form a combined 5 
promoter-operator. Operators are DNA sequences which are recognized by so-called repressor proteins 
which serve to regulate the frequency of transcription initiation at a particular promoter. The 
polymerase travels along the DNA, transcribing the information contained in the coding strans from its 
5' to 3' end into messenger RNA which is in turn translated into a polypeptide having the amino acid 

1 0 sequence for which the DNA codes. Each amino acid is encoded by a nucleotide triplet or "codon" 1 0 

within what may for present purposes be referred to as the "structural gene", i.e. that part which 
encodes the amino acid sequence of the expressed product. After binding to the promoter, the RNA 
polymerase first transcribes nucleotides encoding a ribosome binding site, then a translation initiation 
or "start" signal (ordinarily ATG, which in the resulting messenger RNA becomes AUG), then the 

1 5 nucleotide codons within the structural gene itself. So-called stop codons are transcribed at the end of 15 
the structural gene whereafter the polymerase may form an additional sequence of messenger RNA 
which, because of the presence of the stop signal, will remain untranslated by the ribosomes. 
Ribosomes bind to the binding site provided on the messenger RNA. in bacteria ordinarily as the mRNA 
is being formed, and themselves produce the encoded polypeptide, beginning at the translation start 

20 signal and ending at the previously mentioned stop signal. The desired product is produced if the 20 
sequences encoding the ribosome binding site are positioned properly with respect to the AUG 
initiation codon and if ail remaining codons follow the initiation codon in phase. The resulting product 
may be obtained by lysing the host cell and recovering the product by appropriate purification from other 
bacterial protein. 

25 We perceived that application of -recombinant DNA technology (i.e. the insertion of interferon 25 

genes in microbial expression vehicles and their expression under the control of microbial gene 
regulatory elements) would be the most effective way of providing large quantities of leukocyte 
interferon which, despite the absence in material so produced of the glycosylation characteristic of 
human-derived material, could be employed clinically in the treatment of a wide range of viral and 

30 neoplastic diseases. 30 
The approach to obtaining a first leukocyte gene in a accordance with the present Invention 
involved the following steps: 

(1) Partial amino acid sequences of human leukocyte interferon purified to homogeneity were 
used to construct sets of synthetic DNA probes the codons of which, "in the aggregate, represented all 

35 possible nucleotide combinations capable of encoding the partial amino acid sequences. 35 

(2) Bacterial colony banks were prepared containing complementary DNA (cDNA) from induced 
messenger RNA. Other induced mRNA having been radio-labelled was hybridized to plasmid cDNA 
from this bank. Hybridizing mRNA was eluted and tested for translation Into interferon in oocyte assay. 
Plasmid DNA from colonies shown to induce interferon activity in this manner have further been tested 

40 for hybridization to probes made as described in (1 ) above. 40 

(3) Parallel to the approach In part (2) above, induced mRNA-derived cDNA In plasmids were 
used to form an independent bank of transfbrmant colonies. The probes of part ( 1 ) were used to prime 
the synthesis of radio-labelled single stranded cDNA for use as hybridization probes. The synthetic 
probes hybridized with induced mRNA as template and were extended by reverse transcription to form 

45 induced, radio-labelled cDNA. Clones from the colony bank that hybridized to radio-labelled cDNA 45 
obtained in this manner have been investigated further to confirm the presence of a full-length 
interferon encoding gene. Any partial length putative gene fragment obtained In parts (1) or (2) can 
itself be used as a probe for the fulMength gene. 

(4) The full-length gene obtained above was tailored, using synthetic DNA, to eliminate any 

50 leader sequence that might prevent microbial expression of the mature polypeptide and to permit 50 
appropriate positioning in an expression vehicle relative to start signals and the ribosome binding site 
of a microbial promoter. Expressed interferon was purified to a point permitting confirmation of its 
character and determination of its activity. 

(5) The interferon gene fragment prepared in the foregoing fashion was itself used in probing, by 

55 hybridization, for other partially homologous leukocyte interferon species. 55 
In applying methods of recombinant DNA technology as outlined above the microbial production 
in high yield and purity of the family of homologous leukocyte interferons (unglysosylated) as mature 
polypeptides, essentially unaccompanied by the corresponding presequence or any portion thereof, 
was achieved. These interferons may be directly expressed, recovered and purified to levels fitting them 

60 for use in the treatment of viral and malignant diseases of animals and man. Family members so far 60 
expressed have proven efficacious in in vitro testing and, in the first such demonstration of its kind, in in 
vivo testing as well, the latter involving the first mature leukocyte interferon to have been microbially 
produced. 

The expression "mature leukocyte irrterferon" used in "the context of the present application 
65 defines a microbially (e.g. bacterialry) produced interferon molecule, devoid of glycosyl groups. Mature, 65 
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leukocyte interferon, according to the present invention, is immediately expressed from a translation 
start signal (ATG) just before the first amino add codon of the natural product. The "mature" 
polypeptide may thus contain as the first amino acid in its sequence methionine (for which ATG codes) 
without essentially altering its character. On the other hand the microbial host may process the 
5 translation product to delete the initial methionine. Mature leukocyte interferon could be expressed 5 
together with a conjugated protein other than the conventional leader, the conjugate being specifically 
cieavable in an intra- or extracellular environment (see British Patent Publication No. 2007676A). 
Finally, the mature interferon could be produced in conjunction with a microbial "signal" peptide which 
transports the conjugate to the cell wall, where the signal is processed away and the mature 

1 0 polypeptide secreted. "Expression" of mature leukocyte interferon connotes the bacterial or other 1 0 
microbial production of an interferon molecule containing no gfycosyl groups or a presequence that 
immediately attends mRNA translation of a human leukocyte interferone genome. 

Particular leukocyte interferon proteins hereof have been defined by means of determined DNA 
gene (Figures 3 and 8) and deductive amino acid sequencing (Figures 4 and 9). It will be understood 

1 5 that for these particular interferons, indeed all of the family of leukocyte interferon proteins embraced 1 5 
herein, natural allelic variations exist and occur from individual to individual. These variations may be 
demonstrated by (an) amino acid difference(s) in the overall sequence or by deletions, substitutions, 
insertions, inversions or additions of (an) amino acid(s) in said sequence. For each leukocyte interferon 
protein hereof, labelled LeIF A through LeIF J such allelic variations are included within the scope of the 

20 label or term defining such, and thus, this invention. 20 

Description of the Drawings 

Figure 1 depicts two amino add sequences which were common to all Interferon species isolated 
from human leukocyte and purified to homogeneity designated T — 1 and T — 1 3. All potential mRNA 
sequences coding for these peptides are shown, as are the corresponding DNA sequences. The letters 

25 A, T, G, C and U respectively connote the nucelotides containing the bases adenine, thymine, guanine, 25 
cytosine and uracil. The letter N connotes any one of the nucleotides A. G, C and U. Polynucleotides are 
depicted as reading from the 5' (left) in the 3' (right) direction and, where double stranded ("d.s.") DNA 
is depicted vice-versa for the bottom or non-coding strand. 

Figure 2 is an autoradiogram showing hybridization of potential LeIF plasmids with ^P-labelled 

30 synthetic deoxyoligonucleotides. 30 
Figure 3 depicts the nucleotide sequences (coding strands) of eight gene fragments isolated as 
candidates for use in the expression of leukocyte interferons, respectively designated "A" through "H". 
The ATG translational initiation codon and the termination triplet for each LeIF is underlined. The stop 
codons or termination triplets are followed by 3' untranslated regions. The included full-length gene for 

35 LeIF A is missing one codon found in the others depicted, as indicated in the third A fine of Figure 3. 5' 35 
untranslated regions precede the leader sequences. As isolated, fragment E lacks the full presequence 
of leader, but includes the entire gene for the putative mature LeIF E. Fragment G as Isolated lacks the 
full coding sequence. 

Figure 4 is a comparison of the eight LeIF protein sequences predicted fron nudeotfde sequences. 

40 The one letter abbreviations recommended by the IUPAC-IUB Commission on Biochemical 40 
Nomenclature are used: A, alanine; C, cysteine; D, aspartic acid; E, glutamic acid; F, phenylalanine; G, 
glycine; H, histidine; I, Isoleucine; K, lysine; U leudne; M, methionine; N, asparagine; P, proline; Q, 
glutamine; R, arglnlne; S, serine; T, threonine; V, valine; W, tryptophan; and Y, tyrosine. The numbers 
refer to amino acid positions (S refers to signal peptide). The dash in the 1 65 amino acid LeIF A 

45 sequence at position 44 is introduced to align the LeIF A sequence with the 1 66 amino add sequences 45 
of the other LeIFs. The LeIF E sequence was determined by ignoring the extra nucleotide (position 1 87 
of Figure 3) in its coding region. The asterisks indicate in-phase termination codons. Amino acids 
common to all LeIFs (excluding the pseudogene LeIF E) are also shown). The underlined residues are 
amino adds which are also present in human fibroblast interferon. 

50 Figure 5 depicts restriction endonuclease maps of the eight types of LeIF cloned cDNAs (A 50 

through H). The hybrid plasmids were constructed by the dCrdG tailing method (Goeddel, D.V. et at. 
Nature 287, 41 1— - 41 6 [1 980]). Therefor, the cDNA inserts can be exised using Pst I. The lines at the 
end of each cDNA insert represents the flanking homopolymeric dC:dG tails. The positions of Pvull, 
EcoRI and Bgl II restriction sites are indicated. Shaded regions of the figure represent the coding 

55 sequences of mature LeIFs; the cross-hatched regions indicate signal peptide coding sequences; and 65 
the open regions show 3' and 5' non-coding sequences. 

Figure 6 schematically depicts the construction of a gene coding for the direct microbial synthesis 
of mature LeIF A. Restriction sites and residues are as shown ("Pst I", etc.). The term "b.p ." connotes 
"base pair". 

60 Figure 7 (not to scale) schematically depicts a restriction map of two gene fragments employed in 60 

expressing the mature leukocyte interferon LeIF B. The codon sequences Indicated are the coding 
strand termini resulting from digestion with the restriction enzyme Sau3a In the two cases shown. 

Figures 8 and 9 provide the DNA and amino acid (see Figure 4 above for the corresponding one 
letter abbreviations) sequences of five LeIF proteins hereof, including types 1 and J. In Figure 9, the 
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asterisk indicates a termination codon in the corresponding DNA sequence and the hyphen a deletion 
or gap in the sequence. 

Description of the Preferred Embodiments 
A. Microorganisms Employed 
5 The work described involved use of two microorganisms: E. Coll x 1 776, as described in U.S.P. 5* 

4.1 90.495. and E coli K-1 2 strain 294 (end A, thr, hsr, hsmt), as described in British patent 
publication No. 2055382 A. Each has been deposited with the American Type Culture Collection 
(ATCC accession Nos. 3 1 537 and 3 1 446 respectively). All recombinant DNA work was performed in 
compliance with applicable guidelines of the National Institutes of Health. 

10 The invention, in its most preferred embodiments, is described with reference to E. coli, including 10 

not only strains E. colix 1 776 and E. coli K-1 2 strain 294, defined above, but also other known E. coli 
strains such as E. coli B, or other microbial strains, many of which are deposited and available from 
recognized microorganism depository institutions, such as the American Type Culture Collection. See 
also German Offenlegungsschrift 2644432. These other microorganisms include, for example. Bacilli 

1 5 such as Bacillus subtilis and other enterobacteriaceae among with can be mentioned as examples 1 5 

Salmonella typhimurium and Serratia marcescens, utilizing plasmids that can replicate and express 
heterologous gene sequences therein. Yeast, such as Saccharomyces cerevisiae, may also be employed 
to advantage as host organism in the preparation of the Interferon proteins hereof by expression of 
genes coding therefor under the control of a yeast promoter. 

20 B. Source and Purification of LeIF and mRNA 20 
LeIF mRNA may be obtained from human leukocytes, ordinarily those of patients with chronic 
myelogenous leukemia, that have been induced to produce interferon with Sendai or Newcastle 
disease virus, as described in e.g. German Offenlegungsschrift No. 2947 1 34. A particularly preferred 
source, and that used in the work reported herein, is a cell line designated KG-1 derived from a patient 

25 with acute myelogenous leukemia. The cell line, described by Koeffler, H. P. and Golde, D. W., Science 25 * 
200. 1 1 53 (1978), grows readily in a culture medium comprising RPMI (Rosewell Park Memorial 
Institute) 1 640 plus 1 0% FCS (fetal calf serum) heat-inactivated, 25 mM HEPES buffer (N-2- 
hydroxyemyl-piperazine-N'-2-ethane-sulfonic acid) and 50 ^g/ml of gentamicin, and is subcultured 1 
to 3 split two times a week. Cells may be frozen from the foregoing growth medium plus 1 096 

30 dimethylsulfoxide. KG-1 has been deposited with the American Type Culture Collection (ATCC 30 
accession No. CRL 8031 ). 

KG-1 cells were induced to produce leukocyte Interferon mRNA with Sendai or Newcastle 
disease virus following the procedure described by Rubinstein et al. (Proc. Natl. Acad. Scl. U.S.A. 76 
640 — 644 [1 979]). Cells were harvested 5 hours after induction and RNA prepared by the guanidine 

35 thiocyanate-guanidine hydrochloride procedure (Chirgwin et al.. Biochemistry 18. 5294 — 5299 35 
[1 979)). RNA from uninduced cells was Isolated in the same manner. Oligo deoxythymidin (dT) — 
cellulose chromatography and sucrose gradient ultracentrifugation was used to obtain the 1 2S fraction 
of poly (A) mRNA as described by Green et al. (Arch. Biochem. Biophys. 7 72. 74 — 89 [1 976]) and 
Okuyuma et al. (Arch. Biochem. Biophys. 188. 98 — 104 [1 978]). This mRNA had an interferon titer of 

40 8000 — 1 0,000 units per microgram in the Xenopus laevis oocyte assay (Cavalieri et al., Proc. Natl. 40 
Acad. Sci. U.S.A. 74. 3287—3291 [1977]). 

C. Preparation of Colony Banks Containing LeIF cDNA Sequences 

5 fxg of mRNA was used to prepare double stranded cDNA by standard procedures (Wickens et al., 
J. Biol. Chem. 253. 2483—2495 [1 978] and Goeddel et al„ Nature 281. 544 — 548 [1 979]). The 

45 cDNA was size fractionated by electrophoresis on a 6% poiyacrylamide gel and 230 ng of material 45 
ranging in size from 500 to 1 500 b.p. were recovered by electroelution. A 1 00 ng portion of this cDNA 
was tailed with deoxycytidin (dC) residues as described by Chang et al., Nature 275. 617 — 624 
(1 978), annealed with 470 ng of plasmid pBR322 which has been tailed with deoxy'guanosin (dG) 
residues at the Pstl site (Bolivar et al.. Gene 2. 95 — 1 1 3 [1 977]), and used to transform E. colix 1 776. 

50 Approximately 1 30 tetracycline resistant, ampicillin sensitive transformants were obtained per ng of 50 
cDNA. 

In a second similar experiment, approximately 1000 tetracycline resistant, ampicillin sensitive E. 
coli K-1 2 strain 294 transformants were obtained per ng of cDNA. In this case size fractionated cDNA 
material ranging in size from 600 to 1 300 b.p. was recovered by electroelution for dC tailing. 

55 D. Preparation of Synthetic Oligonucleotides and Their Use 55 
The knowledge of the amino acid sequences of several tryptic fragments of human leukocyte 
interferon permitted the design of synthetic deoxyoligonucieotides complementary to different regions 
of LeIF mRNA. The two tryptic peptides T1 and T1 3 were selected because they had amino acid 
sequences requiring the synthesis of only 1 2 and 4 undecamers, respectively, to account for all 

60 .possible coding sequences (Figure 1 .). Four sets of deoxyoligonucleotide probes were synthesized for 60 
each sequence, containing either three fT-1 A, B, C, D) or one (T-13A, B, C, D oligonucleotide each. The 
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indicated complementary deoxyoiigonucJeotides 1 1 bases long were chemically synthesized by the 
phosphotriester method (Crea et al„ Proc. Natl. Acad. Sci. U.S.A. 75, 5765—5769 [1 978]). Four 
individual probes were prepared in the T-1 3 series. The twelve T-1 probes were prepared in four pools 
of three probes as shown in Figure 1 . 
5 The four individual probes of the T-1 3 series and the twelve T-1 probes prepared in four pools of 5 

three primers each were used to prime the synthesis of radiolabelled single stranded cDNA for use as 
hybridization probes. The template mRNA was either the 12S RNA from SendaWnduced KG-1 cells 
(8000 units IF activity per fig) or total poly (A) mRNA from uninduced leukocytes «10 units per fig). 
^p-labelled cDNA was prepared from these primers using known reaction conditions (Noyes et al„ 

1 0 Proc. Natl. Acad. Sci. U.SA 76, 1 770—1 774 [1 979]). The 60 pi reactions were performed in 20 mM 10 
Tris-HCI (pH 8.3). 20 mM KCI, 8 mM MgC^, 30 mM /3-mercaptoethanol. Reactions Included one fig of 
each primer (i.e. 1 2 fig total for T-1 series. 4 fig total for T-1 3 series), 2 fig of "induced" 1 2S fraction 
mRNA (or 1 0 fig of uninduced poly (A) mRNA), 0.5 mM dATP. dCTP, oTTP, 200 /iCi (a^P) dCTP 
(Amersham, 2 — 3000 Ci/m'mole) and 60 units reverse transcriptase (Bethesda Research Laboratories). 

1 5 Product was separated from unincorporated label by gel filtration on a 1 0 ml Sephadex* G-50 column, \ 5 
treated with 0.3N NaOH for 30 minutes at 70°C to destroy RNA, and neutralized with HCI. 
Hybridizations were performed as described by Kafatos et aL, Nucleic Acids Res. 7, 1 541 — 1 552 
(1979). 

E. Identification of Clones pL1 — pL30 
20 The rapid plasmid isolation procedure of Bimboim et al., Nucleic Acids Res. 7, 1 51 3 — 1 523 ZQ 

(1979) was used to prepare 1 fig of plasmid DNA from each of 500 individual E coli K-1 2 strain 294 

transformants (see C). Each DNA sample was denatured and applied to nitrocellulose filters in triplicate 

following the procedure of Kafatos et aL (see above). 

The three sets of nitrocellulose filters containing the 500 plasmid samples were hydridized with 
25 a) induced cDNA primed with the T-1 set of primers, 25 

b) T-1 3 primed induced cDNA, and ^ 

c) uninduced cDNA prepared by using both sets of primers. Clones were considered positive if 
they hybridized more strongly to one or both of the induced cDNA probes than to the total uninduced 
probe. Thirty "positive" clones (pL1 — pL30) were selected from the 500 for further analysis. 

30 F. Identification of Clones pL31 — pL39 Isolation of a Plasmid (No. 104) Containing a LeIF Gene 30 
Fragment 

Transformants of E. Colix 1 776 were screened by the colony hybridization procedure of 
Grunstein and Hogness (Proc. Natl. Acad. Sci. U.S.A. 72, 3961—3965 [1975]) using ^P-labelled 
induced mRNA as probe (Lillenhaug et aU Biochemstry. 15. 1 858 — 1 865 [1 976]). Unlabeled mRNA 
35 from uninduced cells was mixed with the probe at a ratio of 200 to 1 to compete with uninduced 35 
mRNA present in the M P-labelled preparation. Hybridization of labelled mRNA should occur 
preferentially to colonies containing induced sequences. Three classes of transformants were obtained: 

( 1 ) 2 — 3% of the colonies hybridized to "P-mRNA very strongly, 

(2) 1096 hybridized significantly less than class 1 , and 

40 (3) the remainder gave no detectable hybridization signal. 40 

The positive colonies (classes (1 ) and (2)) were examined for the presence of interferon-specific 
sequences by an assay which depends upon hybridization of interferon mRNA specifically to plasmid 
DNA. Initially, 60 strong positive colonies (class 1 ) were grown individually In 1 00 ml of M9 medium 
supplemented with tetracycline (20 fig/m\). diaminopimelic acid (100/*g/ml), thymidine (20/ig/ml), 

45 and d-biotin ( 1 fig/m\). The M9 medium contains per liter Na a HP0 4 (6 g), KHJP0 A (3 g), NaCI (0.5 g) and 45 
NH 4 CI (1 g). After autoclaving 1 ml of sterile 1 M MgS0 4 and 10 ml of sterile 0.01 M CaCI 2 are added. 
Ten cultures were pooled and plasmid DNA was Isolated from the six pools as described by Clewell et 
al.. Biochemistry 9, 4428 — 440 [1 970]. Ten fig of each plasmid DNA pool were cleaved with Hindlli, 
denatured and covalently bound to DBM (diazobenzyloxymethyl) paper. One fig of purified mRNA from 

50 induced cells was hybridized to each filter. Unhybridized mRNA was removed by washing. The 50 
specifically hybridized mRNA was eluted and translated in Xenopus laevis oocytes. By this assay, all six 
pools were negative. Five pools of ten colonies each and one pool of nine colonies were made from 59 
weakly positive colonies, (class 2) and plasmids were prepared from the pools and examined as above. 
Among the six pools tested, one (K1 0) hybridized to interfon mRNA at levels significantly above 

56 background levels each time it was tested. In order to identify the specific interferon cDNA clone 55 
plasmid DNAs were prepared from the 9 colonies of pool K1 0 and examined individually. Two of the 
nine plasmids (No. 101 and No. 1 04) bound interferon mRNA well above background levels. From 
plasmid No. 1 04 a unique Bgl II restriction fragment containing 260 b.p. was isolated, labelled with "P 
using the procedure described by Taylor et at., Biochim. Biophys. Acta 442, 324 — 330 ( 1 976), and 

60 used as probe to independently screen 400 E. coli 294 transformants by an In situ colony screening 60 
procedure (Grunstein and Hogness, (Proc. Natl. Sci. U.S.A. 72, 3961—3965 [1 975]). Nine colonies 
(pL31 — pL39) were identified which hybridized to different extents with this probe. 

In addition, the labelled 260 b.p. fragment was used to Independently screen 4000 E. coll 294 
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transformants in the same manner. 50 colonies were identified which hybridized to different extents 
with this probe. One contained the LeIF G fragment, one contained the LeIF H fragment, and one 
contained a fragment designated LeIF H1 , an apparent allele of LeIF H. The hybrid plasmids which 
result are designated "pLelF H"\ etc. 

5 G. Isolation and Sequencing of a First FuIMength LeIF Gene 5 

o*« u Pla lT. i ? DNA was prepared from a " 39 Potential LeIF cDNA clones and rescreened with the same 
260 b.p. DNA probe using the hybridization procedure of Kafatos et al. (see above). Three plasmids 
(pL4, pL31, pL34) gave very strong hybridization signals, four (pL13, pL30, pL32, pL36) hybridized 
moderately, and three (pL6, pL8. pL1 4) hybridized weakly with the probe. 
1 0 The 39 potential LeIF cDNA recombinant plasmids were also screened by using 32 P-labei!ed 1 0 

synthetic! iindecamers (individual T-1 primer pools or individual T-1 3 primers) directly as hybridization 
probes. The hybridization condraons were chosen such that perfect base pairing should be required for 
detectable hybridization signals (Wallace et al.. Nucleic Acids Res. 6, 3543 — 3557 [1 979]). Thus 
plasmid DNA from the 39 clones was prepared by a standard cleared lysate procedure (Clewell et al 
15 see above) and purified by Biorad Agarose A-50 column chromatography. Samples O/igJofeach " 15 
preparation were linearized by treatment with Eco Rl, denatured in alkali and spotted on 2 separate 
nitrocellulose filters, 1 .5 fig per spot (Kafatos et al., see above). Individual synthetic 
deoxyoligonudeotide primers and primer pools were phosphorylated with (y»P) ATP as follows* 50 
pmoles of oligonucleotide and 100 pmoles of (p^PJATP (New England Nuclear, 2500 Ci/mmole) were 
20 combined in 30 of 50 mM Tris-HCI, 1 0 mM MgCl a 1 5 mM 0-mercaptoethanol. 2 units of T4 20 
polynucleotide kinase were added and, after 30 minutes at 37°C, ^P-labelled primers were purified by 
chromatography on 10 ml Sephadex* G-50 columns. Hybridizations were performed using 10* cpm or 
primer T-1 3C or 3 x 1 0 8 cpm of primer pool T-1 C. The hybridizations were performed at 1 5°C for 1 4 
hours in 6xSSC [1 xSSC=0.15 M NaCl, 0.01 5 M sodium citrate, pH 7.2], 1 0xDenhardt's [0.2% bovine 
25 serum albumine, 03% polyvinyfpyrolidone, 0.2% Ficoll] solution, as described by Wallace et al. (see 25 
above). Filters were washed far 5 minutes (3 times) at 0°C in 6xSSC, dried, and exposed to x-ray film 
Results are shown in Fig. 2 for^P-prfmer pool T-1 3C and primer T-1 C. 

Plasmid DNA from clone 104 was found to give significant hybridization with primer pool T-1 C 
and primer T-1 3C, but no detectable hybridization with the other undecamers. As shown in Rgure 2, 
30 several of the 39 potential LeIF plasmids (pL2, 4, 1 3, 1 7, 20, 30, 3 1 , 34) also hybridized with both of 30 
these probes. However, restriction analysis showed that only one of these plasmids, pL3 1 , also 
contained a 260 b.p. internal Bgfll fragment. Pstl digestion of pL31 showed the size of the cDNA insert 
to be approximately 1 000 b.p- 

The entire Pstl insert of pL31 was sequenced by both the Maxam-Gilbert chemical method 
35 (Methods Enzymol. 65, 499—560 [1 980]) and by the dideoxy chain termination procedure (Smith, 35 
Methods Enzymol. 65, 560—580 [1 980]) after subcloning Sau3a fragments into an M 1 3 vector. The 
DNA sequence is shown ("AT in Figure 3. The appropriate translational reading frame could be 
predicted from protein sequence information in hand, the known range of LeIF molecular weights, and 
the relative Incidence of stop triplets in the three possible reading frames, and that In turn permitted 
40 prediction of the entire LeIF amino acid sequence, including a pre- or signal peptide. The first ATG 40 
translational initiation codon is found 60 nucleotides from the 5' end of the sequence and is followed, 
1 88 codons later, by a TGA termination triplet; there are 342 untranslated nucleotides at the 3' end, ' 
followed by a pory (A) sequence. The putative signal peptide (presumably involved in the secretion of 
mature LeIF from leukocytes) is 23 amino acids long. The 1 65 amino acids constituting the mature LeIF 
45 have a calculated MW of 1 9,390. We have termed the LeIF encoded by pL3 1 "LeIF A." It can be seen 45 
from the sequence data ("Al "wx Fig. 4 that the tryptic peptides T1 and T1 3 of LeIF B correspond to 
amino acids 1 45 — 1 49 and 57 — 6 1 respectively of LeIF A. The actual DNA coding sequences found in 
these two regions are those represented by primer pool T1 -C and primer T1 3-C (see Fig. 8) 

H. Direct Expression of Mature Leukocyte Interferon A (LeIF A) 
60 1 . Generally qq 

The procedure followed to express LeIF A directly as a mature Interferon polypeptide is a variant 
of that earlier employed for human growth hormone (Goeddel et al.. Nature 281, 544 — 548 [1979J), 
insofar as it involved the combmation of synthetic (N-terminal) and complementary DNAs- 

As shown in Rgure 6, a Sau3a restriction endonuclease site is conveniently located between 

55 codons 1 and 3 of LeIF A. Two synthetic deoxyoligonucleotides were designed which incorporate an 55 
ATG translational initiation codon, restore the codon for amino acid 1 (cysteine), and create an EcoRI 
sticky end. These oligomers were ligated to a 34 b.p. Sau3a — Avail fragment of pL3 1 . The resulting 45 
b.p. product was ligated to two additional DNA fragments to construct an 865 b.p. synthetic-natural 
hybrid gene which codes for LeIF A and which is bounded by EcoRI and Pstl restriction sites. This gene 

60 was inserted Into pBR322 between the EcoRI and Pstl sites to give the plasmid pLelF A 1 . " $0 
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2. Construction of the Tryptophan Control Element (Containing the E. Coli trp Promoter, 
Operator and trp Leader Ribosome Binding Site but Lacking an ATG Sequence for Initiation of 
Translation). 

Plasmid pGMI carries the E. coli tryptophan operon containing the deletion ALE1413 (Miozzari et 
5 aL, J. Bacteriology 133, 1 457 — 1 466 [1 978]) and hence expresses a fusion protein comprising the 5 
first 6 amino acids of the trp leader and approximately the last third of the trp E polypeptide (hereinafter 
referred to in conjunction as LE'), as well as the trp D polypeptide in its entirety, all under the control of 
the trp promoter-operator system. The plasmid, 20 fig, was digested with the restriction enzyme Pvull 
which cleaves the plasmid at five sites. The gene fragments were next combined with EcoRI linkers 

1 0 (consisting of a self complementary oligonucleotide of the sequence: pCATGAATTCATG) providing an 10 
EcoRI cleavage site for a later cloning Into a plasmid containing an EcoRI site. The 20 /ig of DNA 
fragments obtained from pGM1 were treated with 10 units T 4 DNA figase in the presence of 200 
pmoles of the B'-phosphorylated synthetic oligonucleotide pCATGAATTCATG and in 
20 fi\ T 4 DNA ligase buffer (20 mM Tris f pH 7.6, 0.5 mM ATP, 1 0 mM MgCI 2 ,*5 mM dithiothrehol) 

15 at 4°C overnight The solution was then heated 1 0 minutes at 70°C to halt ligation. The \ 5 

linkers were cleaved by EcoRI digestion and the fragments, now with EcoRI ends were separated 
using 5 percent polyacryl amide gel electrophoresis (hereinafter PAGE) and the three largest 
fragments isolated from the gel by first staining with ethidium bromide, locating the fragments with 
ultraviolet light, and cutting from the gel the portions of interest Each gel fragment with 300 

20 microliters 0. 1 xTBE, was placed in a dialysis bag and subjected to electrophoresis at 1 00 V for one 20 
hour in 0. 1 xTBE buffer (TBE buffer contains: 1 0.8 g Tris base, 5.5 g boric acid, 0.09 g Na 2 EDTA in 1 
liter H 2 0). The aqueous solution was collected from the dialysis bag, phenol extracted, chloroform 
extracted and made 0.2 M sodium chloride, and the DNA recovered in water after ethanol precipitation. 
The trp promoter-operator-containing gene with EcoRI sticky ends was identified in the procedure next 

25 described, which entails the insertion of fragments into a tetracycline sensitive plasmid which, upon 25 
promoter-operator insertion, becomes tetracycline resistant 

Plasmid pBRHI (Rodriguez et a!.. Nucleic Acids Res. 6", 3267 — 3287 [1 979] expresses ampicillin 
resistance and contains the gene for tetracycline resistance but, there being no associated promoter, 
does not express that resistance. The plasmid is accordingly tetracycline sensitive. By introducing a 
30 promoter-operator system in the EcoRI site, the plasmid can be made tetracycline resistant 30 

pBRHI was digested with EcoRI and the enzyme removed by phenol extraction followed by 
chloroform extraction and recovered in water after ethanol precipitation. The resulting DNA molecule 
was, in separate reaction mixtures, combined with each of the three DNA fragments obtained above 
and ligated with T 4 DNA ligase as previously described. The DNA present in the reaction mixture was 
35 used to transform competent E. coli K-1 2 strain 294 by standard techniques (HershfiekJ et al., Proc. 35 
Natl. Acad. Sci. U.S.A. 71 , 3455 — 3459 [1 974] and the bacteria plated on LB (Lura-Bertani) plates 
containing 20 yug/ml ampicillin and 5 /*g/ml tetracycline. Several tetracycline-resistant colonies were 
selected, plasmid DNA isolated and the presence of the desired fragment confirmed by restriction 
enzyme analysis. The resulting plasmid Is designated pBRHtrp. 

40 An EcoRI and BamHI digestion product of the viral genome of hepatitis B was obtained by 40 

conventional meansand cloned Into the EcoRI and BamHI sites of plasmid pGH6 (Goeddel et al. Nature 
281 ', 544 (1 979]) to form the plasmid pHS32. Plasmid pHS32 was cleaved with Xbal, phenol 
extracted, chloroform extracted and ethanol precipitated. It was then treated with 1 p\ E. coli DNA 
polymerase I, Klenow fragment (Beohringer-Mannhelm) in 30 fit polymerase buffer (50 mM potassium 

45 phosphate pH 7.4, 7 mM MgCI 2 , 1 mM /J-mercaptophenol) containing 0.1 mM oTTP and 0.1 mM dCTP 45 
for 30 minutes at 0°C then 2 hours at 37 °C This treatment causes 2 of the 4 nucleotides 
complementary to the 5' protruding end of the Xbal cleavage site to be filled in: 

5' CTAGA — 5' CTAGA— 

3' T— 3' TCT — 

50 Two nucleotides, dC and dT, were incorporated giving and end with two 5' protruding nucleotides. This 50 
linear residue of plasmid pHS32 (after phenol and chloroform extraction and recovery in water after 
ethanol precipitation) was cleaved with EcoRI. The large plasmid fragment was separated from the 
smaller EcoRI — Xbal fragment by PAGE and isolated after electroelution. This DNA fragment from 
pHS32 (0.2 fig), was ligated, under conditions similar to those described above, to the EcoRI — Taql 

55 fragment of the tryptophan operon (~0.01 fig), derived from pBRHtrp. 55 

In the process of ligating the fragment from pHS32 to the EcoRI — Taql fragment as described 
above, the Taql protruding end is ligated to the Xbal remaining protruding end even though it is not 
completely Watson-Crick base-paired: 

— T CTAGA— — TCTAGA— 

+ -» 

60 — AGC TCT— — AGCTCT— 60 



A portion of this ligation reaction mixture was transformed into E coli 294 ceils, heat treated and 
plated on LB plates containing ampicfllin. Twenty-four colonies were selected, grown in 3 ml LB (Luria- 
Bertani) media, and plasmid isolated Six of these were found to have the Xbal site regenerated via E 
coli catalyzed DISIA repair and replication: 

5 — TCTAGA — — TCTAGA — 5 

-+ 

— AGCTCT — — AGATCT— 

These plasmids were also found to cleave both with EcoRI and Hpal and to give the expected restriction 
fragments. One plasmid, designated pTrpI 4, was used for expression of heterologous polypeptidesTas 
next discussed. 

1 0 The plasmid pHGH 1 07 (Goeddel et al., Nature 28 1 , 544, [1 979]) contains a gene for human 1 0 

growth hormone made up of 23 amino acid codons produced from synthetic DNA fragments and 1 63 
amino acid codons obtained from complementary DNA produced via reverse transcription of human 
growth hormone messenger RNA. This gene, though it lacks the codons of the "pre" sequence of 
human growth hormone, does contain an ATG translation initiation codon. The gene was isolated from 

fS^l 07 after treatment ***** EcoRI followed by E. coli DNA polymerase I Klenow fragment 1 5 
and dTTP and dATP as described above. Following phenol and chloroform extraction and ethanol 
precipitation the plasmid was treated with BamHI. 

The human growth hormone (HGH) gene-containing fragment was isolated by PAGE followed by 
electroelution. The resulting DNA fragment also contains the first 350 nucleotides of the tetracyline 

20 resistance structural gene, but lacks the tetracycline promoter-operator system so that, when 20 
subsequently cloned into an expression plasmid, plasmids containing the Insert can be located by the 
restoration of tetracycline resistance. Because the EcoRI end of the fragment has been filled in by the 
Klenow polymerase I procedure, the fragment has one blunt and one sticky end, ensuring proper 
orientation when later inserted into an expression plasmid. 

25 The expression plasmid pTrp 1 4 was next prepared to receive the HGH gene-containing fragment 25 

prepared above. Thus, pTrp14 was Xbal digested and the resulting sticky ends filled in with the Klenow 
polymerase I procedure employing dATP, dTTP, dGTP and dCTP. After phenol and chloroform 
extraction and ethanol precipitation the resulting DNA was treated with BamHI and the resulting large 
plasmid fragment isolated by PAGE and electroelution. The pTrpI 4-derived fragment had one blunt 

30 and one sticky end, permitting recombination in proper orientation with the HGH gene containing 30 
fragment previously described. 

The HGH gene fragment and the pTrp14 AXba-BamHI fragment were combined and ligated 
together under conditions similar to those described above. The filled in Xbal and EcoRI ends ligated 
together by blunt end ligation to recreate both the Xbal and the EcoRI site: 

35 Xbal filled in EcoRI filled in HGH gene initiation 35 

— TCTAG * AATTCTATG— — TCTAGAATTCTATG— 

— AGATC TTAAGATAC— — AGATCTTAAGATAC — 

Xbal EcoRI 

This construction also recreates the tetracycline resistance gene. Since the plasmid pHGH 1 07 

40 expresses tetracycline resistance from a promoter lying upstream from the HGH gene (the lac 40 
promoter), this construction designated pHGH 207, permits expression of the gene for tetracycline 
resistance under the control of the tryptophan promoter operator. Thus the ligation mixture was 
transformed into E. coli 294 and colonies selected on LB plates containing 5 fig/m\ tetracycline. 
Plasmid pHGH 207 was EcoRI digested and a 300 b.p. fragment containing the tip promoter, 

45 operator and trp leader ribosome binding site but lacking an ATG sequence for initiation of translation 45 
was recovered by PAGE followed by electroelution. This DNA fragment was cloned into the EcoRI site 
of pLelF A. Expression plasmids containing the above modified trp regulon {E. coli trp operon from 
which the attenuator sequence has been deleted to controllably heighten expression levels) can be 
grown to predetermined levels in nutrient media containing additive tryptophan in quantities sufficient 

50 to repress the promoter-operator system, then be deprived of tryptophan so as to derepress the system 50 
and occasion the expression of the Intended product 

More particularly, and with reference to Figure 6, 250 fig of plasmid pL31 were digested with 
Pstl and the 1 000 b.p. insert isolated by gel electrophoresis on a 6% polyacrylamide gel. Approximately 
40 ^9 of Insert was electrocuted from the gel and divided into 3 aliquots for further digestion: a) A 1 6 

55 fig sample of this fragment was partially digested with 40 units of Bglll for 45 minutes at 37°C and the 55 
reaction mixture purified on a 6% polyacrylamide gel. Approximately 2 fig of the desired 670 b.p. 
fragment were recovered, b) Another sample (8 fig) of the 1 000 b.p. Pstl insert was restricted with 
Avail and Bglll. One fig of the indicated 1 50 b.p. fragment was recovered after gel electrophoresis, c) 
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1 6 hq of the 1000 b.p. piece was treated with Sau3a and Avail. After electrophoresis on a 1096 
polyacrylamide gel, approximately 0.25 fig ( 1 0 pmole) of the 34 b.p. fragment was recovered. The two 
indicated deoxyoRgonucleotides, 5'-dAATTCATGTGT (fragment 1) and 5'-dGATCACACATG 
(fragment 2) were synthesized by the phosphotriester procedure. Fragment 2 was phosphorylated as 
5 follows. 200 pi (~ 40 pmole) of {y* 2 ?) ATP (Amersham, 5000 Ci/mmole) was dried down and 

resuspended in 30 /u\ of 60 mM Tris-HCI (pH8), 1 0 mM MgCI 2 , 1 5 mM 0-mercaptoethanoI, containing 
100 pmoles of DNA fragment and 2 units of T4 polynucleotide kinase. After 1 5 minutes at 37 °C, 1 p\ 
of 1 0 mM ATP was added and the reaction allowed to proceed another 1 5 minutes. The mixture was 
then heated at 70°C for 1 5 minutes, combined with 1 00 pmole of 5'-0H fragment 1 and 1 0 pmole of 
1 0 the 34 b.p. Sau3a-Avall fragment Ligation was performed for 5 hours at 4°C in 50 fi\ of 20 mM Tris- 1 0 
HCI (pH 7.5) 1 0 mM Mg Cl 2 , 1 0 mM dithiothreitol, 0.5 mM ATP and 1 0 units T4 DNA figase. The mixture 
was electrophoresed on a 6% polyacrylamide gel and the 45 b.p. product recovered by electroelution. 
About 30 ng (1 pmole) of the 45 b.p. product were combined with 0.5 fig (5 pmoles) of die 
1 50 b.p. Avail — Bglll fragment and 1 fig (2 pmoles) of the 670 b.p. Bgfll — Pstl fragment The 

1 5 ligation was performed at 20° C for 1 6 hours using 20 units of T4 DNA ligase. The (igase 1 5 

was inactivated by heating to 65°C for 10 minutes. The mixture was then digested with EcoRl 
and Pstl to eliminate polymers of the gene. The mixture was purified by 6% PAGE. About 20 ng 
(0.04 pmole) of the 865 b.p. product were isolated. One-half (1 0 ng) of this was ligated into 
pBR322 (0.3 fig) between the EcoRl and Pstl sites. Transformation of E. coii 294 gave 70 tetracycline 

20 resistant emptciilin sensitive transformants. Plasmid DNA isolated from 1 8 of these transformants was 20 
digested with EcoRl and Pstl. 1 6 of the 1 8 plasmlds had an EcoRl — Pstl fragment 865 b.p. in length. 
One fig of one of these, pLelF A1 , was digested with EcoRl and ligated to the 300 b.p. EcoRl fragment 
(0.1 fig) containing the E. coll trp promoter and trp leader ribosome binding site, prepared as described 
above. Transformants containing the trp promoter were identified using a M P-trp probe in conjunction 

25 with the Grunstein-Hogness colony screening procedure. An asymetrically located Xbal site In the trp 25 
fragment allowed determination of recombinants in which the trp promoter was oriented in the 
direction of the Lei F A gene. 

I. In Vitro and In Vivo Activity of LeIF A 

Extracts were prepared for IF assay as follows: one ml cultures were grown in L broth containing 

30 5 /ig/m! tetracycline to an Ag 50 value of about 1 .0, then diluted Into 25 ml of M9 media containing 5 30 
jig/ml tetracycline. 10 ml samples were harvested by centrifugation when reached 1.0 and cell 
pellets were suspended in 1 ml of 1 5 percent sucrose, 50 mM Tris-HCI (pH 8.0), 50 mM EDTA. One mg 
of lysozyme was added and, after 5 minutes at 0°C, cells were disrupted by sonication. The samples 
were centrifuged 10 minutes (1 5,000 rpm) and Interferon activity in the supernatants was determined 

35 by comparison with LeIF standards by the cytopathic effect (CPE) inhibition assay. To determine the 35 
number of IF molecules per cell a LeIF specific activity 4x10* unrts/mg was used. 

As shown In Table 1 , Clone pLelF A trp 25, in which the trp promoter was inserted in the desired 
orientation, gives high levels of activity (as high as 2.5 x 1 0 8 units per liter). As shown in Table 2, the IF 
produced by E. coii K-1 2 strain 294/pLelF A trp 25 behaves like authentic human LeIF; it Is stable to 

40 treatment at pH 2 and is neutralized by rabbit anti-human leukocyte antibodies. The interferon has an 40 
apparent molecular weight of approximately 20,000. 

The in vivo efficacy of interferon requires the presence of macrophages and natural killer (NK) 
cells and the in vivo mode of action appears to involve stimulation of these cells. Thus, it remained 
possible that the interferon produced by E coii 294/pLeIF A 25, while having antiviral activity in the 

45 cell culture assay, would not be active in infected animals. Moreover, the in vivo antiviral activity of the 45 
bacterially produced, non-glycosylated LeIF A might be different from the glycosylated LeIF derived from 
human "buffy coat" leukocytes. Therefore the biological activity of bacterially synthesized LeIF A (2% 
pure) was compared with buffy coat LeIF (8% pure) in lethal encephalomyocarditis (EMC) virus 
infection of squirrel monkeys (Table 3). 

50 Table 1 50 



Interferon Activity in Extracts of E. Coll 



E.coliK-12 






LeIF mole- 


strain 294 


Cell density 


IF Activity 


cules per 


transformed by 


(cells/ml) 


units/mi culture 


cell 


pLelF A trp 25 


3.5x10 8 


36,000 


9.000 


pLelF A trp 25 


1.8x10 s 


250,000 


12,000 



* 
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Table 2 

Comparison of Activities of Extracts from E. Coli 
294/pLelF A 25 with Standard LeIF* 



294/pLelFAtrp25 

extract 
LeIF standard 



Interferon Activity (units/ml) 


untreated 


pH2 


rabbit anti-human 
leukocyte antibodies 


500 




500 


<10 


500 




500 


<10 



10 



15 



•The 250,000 units/ml extract of E. coli 294/pLelF A trp 25 described in Table 1 was diluted 500- 
fold with minimal essential medium giving a specific activity of 500 units/ml. A leukocyte interferon 
standard (Wadley Institute) previously titrated against the NIH leukocyte interferon standard, was also 
diluted to a final concentration of 500 U/ml. One ml aliquots were adjusted to pH 2 with 1 N HCI, 
incubated at 4°C for 52 hours, neutralized by addition of NaOH and IF activity determined by the 
standard CPE inhibition assay. 25 /*! aliquots of the 500 units/ml samples (untreated) were incubated 
wrth 25 fx\ of rabbit anti-human leukocyte interferon for 60 minutes at 37°C f centrifuged at 1 2,000xg 
for 5 minutes and the supernatant assayed. 

Table 3 

Antiviral Effect of Various LeIF Preparations 
Against EMC virus infection of Squirrel Monkeys 







Serum 
plague forming units/ml 


Treatment 


Survivors 


day 2 


day 3 


day 4 


Control 
(bacterial proteins) 

Bacterial LeIF A 
LeIF standard 


0/3 

• 

3/3 
3/3 


0 >3 

o) 

0 
0 
0 
0 
0 
0 


3x10*} 

o Mo 4 

0 J 

0 

0 

0 

0 

0 

0 


10 s "| 

1,200 y>3.4x10« 
0 J 
0 
0 
0 
0 
0 
0 



•25 



All monkeys were male (average weight 713 g) and had no EMC virus antibodies prior to 

infection. The monkeys were infected intramuscularly with 1 00xLD M EMC virus (determined in mice). 

The control treated monkeys died at 1 34, 1 58 and 1 64 hours post infection. Interferon treatments with 

20 1 0 6 units were by the intravenous route at -4, +2, 23, 29, 48, 72, 1 68 and 240 hours, relative to 

infection. The bacterial leukocyte interferon was a column chromatography fraction from a lysate of E. 

coli 294/pLelF A 25 at a specific activity of 7.4x 1 0 fl units/mg protein. The control bacterial proteins 

were an equivalent column fraction from a lysate of E coli 294/pBR322 at twice the total protein 

concentration. The leukocyte interferon standard was Sen da i virus induced interferon from normal 

human "buffy-coat" cells, purified chromatographically to a specific activity of 32 x 1 0 s units/mg 
protein. 

The control monkeys showed progressive lethargy, loss of balance, flaccid paralysis of the hind- 
limbs and watering of the eyes commencing around 8 hours prior to death. The Interferon treated 
monkeys showed none of these abnormalities; they remained active at all times and developed no 
viremia. The one monkey in the control group which did not develop viremia by 4 days died latest (1 64 
hours post infection) but showed high titers of virus in the heart and brain on post mortem. The 
interferon treated monkeys did not develop antibodies to EMC virus as determined 1 4 and 21 days 
after infection. These results demonstrate that the antiviral effects of LeIF preparations in the Infected 
animals can be attributed solely to interferon because the contaminating proteins are quite different in 
the bacterial and buffy coat preparations. In addition these results indicate that glycosylation is not 
required for the in vivo antiviral activity of LeIF A. 

J. Isolation of cONAs for Additional Leukocyte Interferons 

DNA from the fully characterized LeIF a cDNA-containlng plasmid was excised with Pst I isolated 
electrotrophoretically, and labelled with »p. The resulting radioactively labelled DNA was used as a 
probe to screen additional E. coli 294 transformants, obtained identically as those in Part C by the in 
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situ colony screening procedure of Grunstein and Hogness (see above). Colonies were isolated which 
hybridized in varying amounts to the probe. Plasmid DMA from these colonies and the ten hybridizing 
colonies referred to in Part G above was isolated by Pst I cutting and characterized by three different 
methods. First, these Pst fragments were characterized by their restriction endonudease digestion 
5 patterns with the enzymes Bgl II, Pvu II, and Eco Rl. This analysis allowed the classification of at least 5 
eight different types (LeIF A, LeIF B, LelF C, LeIF D, LeIF E. LeIF F, LeIF G and LeIF H), shown in Figure 5, 
which approximates the location of various restriction cuts relative to the by-now known presequence 
and coding sequence of LeIF A. One of these, LeIF D, is believed to be identical to that reported by 
Nagata et al., Nature 284, 3 1 6—320 { 1 980). 

1 0 Secondly, certain of the DNAs were tested by the hybridization selection assay described by j q 

Cleveland et al., Celt 20, 95 — 1 05 (1 980) for the ability to selectively remove LeIF mRNA from pory-A 
containing KG-1 cell RNA. LeIF A, B, C and F were positive by this assay. Third, the latter Pst fragments 
were inserted in an expression plasmid. E coli 294 was transformed with the plasmid, and the 
fragments were expressed. The expression products, believed to have pre-interferons, were all positive 

15 by CPE assay for interferon activity, albeit marginally active In the case of the LeIF F fragment In 15 
addition to the foregoing, all of the LeIF types described have been sequenced. 

K. Direct Expression of a Second Mature Leukocyte Interferon (LeIF B) 

The sequence of the Isolated fragment comprising the gene for mature LeIF B shows the first 
fourteen nucleotides of types A and B to be identical. Accordingly a fragment from pLelF A25 bearing 
20 the trp-promoter-operator, ribosome binding site and the start of the LeIF A (=B) gene was isolated and 20 
combined with the remaining portion of the B sequence in an expression plasmid. The salient 
restriction maps for the Pst fragment of pL4 (a plasmid comprising the LeIF B Pst-ended gene depicted 
in Figure 5) and pLelF A25 are shown, respectively, in Figures 7a and 7b. 

To obtain the approximately 950 b.p. Sau3a to Pst] fragment from the sequence shown in Figure 
25 7a several steps were necessary because of the presence of one or more Intervening Sau3a restriction 25 
sites, i.e.: 

1 . The following fragments were isolated: 

a) 1 1 0b b.p. from Sau3a to EcoRI; 

b) 132 b.p. from EcoRI to Xba; 

30 c) >700 b.p. from Xba to Pst. 30 

2. Fragments (1 a) and (1b) were ligated and cut with Xba and Bglll to preclude self- 
polymerization through Sau 3a and Xba end terminals (the relevant Sau3a site was within a Bglll site; 
Bgl 1 1 cuts to leave a Sau3a sticky end). A 242 b.p. fragment was isolated. 

3. The product of (2) and (1c) were ligated and cut with Pstl and Bglll, again to prevent self- 

35 polymerization. An approximate 950 b.p. fragment, Sau 3a to Pstl of Figure 7a, was isolated. This 35 
fragment comprised that portion of the LeIF B gene not common to LeIF A. 

4. An approximate 300 b.p. fragment (HIndtll to Sau3a) comprising the up promoter-operator, 
ribosome binding site, ATG start signal and cysteine codon of LeIF A was isolated from pLelF A25. 

5. An approximately 3600 b.p. fragment Pstl to Hindlll was isolated from pBR322. This 

40 comprised the replicon and encoded tetracycline but not ampiciUin resistance. 40 

6. The fragments obtained in steps 3, 4 and 5 were triple-ligated and the resulting plasmid 
transformed into E coli K-1 2 strain 294. 

Transformants were miniscreened (Birnbolm et al., Nucleic Acids Res. 7, 1 5 1 3 — 1 523 [1 979]) 
and plasmid samples were digested with EcoRI. Digests yielded three fragments characteristic of: 1 ) 
45 The EcoRI-EcoRI trp promoter fragment; 2) The Internal EcoRI to EcoRI fragment of pL4; and 3) protein 45 
translational start signal to EcoRI fragment of pL4. 

In CPE assay, bacterial extracts from clones made in the foregoing fashion typically assay at 
about 1 0x 1 0 s units interferon activity per liter at A sso =1 . One representative clone prepared in this 
manner is E. coli 294/pLelF B trp 7. 

50 L. Direct Expression of Further Mature Leukocyte Interferons (LeIF, C, D, F, G, H, I and J) 50 

Additional full-length gene fragments that comprise other LeIF types may be tailored and placed 
in expression vehicles for expression as in the case of LeIF A. Complete sequencing by conventional 
means will reveal whether a restriction site lies sufficiently near the first amino acid codon of the 
mature interferon type as to permit convenient resort to the approach employed in part H, supra, for the 

55 expression of mature LeIF A, i.e., elimination of the presequence by restriction cutting and replacement 55 
of codons for the N-termfnal amino acids lost in presequence elimination by ligation of a synthetic DNA 
fragment Failing that, the following procedure may be employed. Briefly, this entails cleaving the 
presequence-containing fragment precisely before the point at which the codon for the first amino acid 
of the mature polypeptide begins, by: 

60 1 • converting the double stranded DNA to single-stranded DNA in a region surrounding that point; 60 

2. hybridizing to the single-stranded region formed in step (a) a complementary primer length of 
single-stranded DNA, the 5' end of the primer lying opposite the nucleotide adjoining the intended 
cleavage site; 
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3. restoring that portion of the second strand eliminated in step 1 which lies in the 3' direction 
from the primer by reaction with DNA polymerase in the presence of adenine, thymine, guanine and 
cytosine-containing deoxynucieotide triphosphates; and 

4. digesting the remaining single-stranded length of DNA which protrudes beyond the intended 

5 cleavage point 5 

A short length of synthetic DNA terminating, at the 3' end of the coding strand, with the ; 
translation start signal ATG can then be ligated by, e.g., blunt-end ligation to the resulting tailored gene 
for the mature interferons and the gene inserted into an expression plasmld and brought under the 
controi of a promoter and its associated ribosome binding site. « 

10 In a manner similar to that employed in part K. supra, gene fragments encoding LeIF C and LeIF D 10 

were appropriately configured for direct bacterial expression. The expression strategy for these 
additional leukocyte interferons included, in each case, resort to the approximately 300 b.p. fragment 
(Hindlll to Sau3a) comprising the trp promoter-operator, ribosome binding site, ATG start signal and 
cysteine codon of LeIF A from pLelF A25. To this were combined gene fragments from the additional 

1 5 interferon genes encoding their respective amino acid sequences beyond the initial cysteine common 1 5 
to all. Each resulting plasmid was used to transform E. coli K-1 2 strain 294. Ligations to form the 
respective genes were as follows: 

LeIFC 

Isolate the following fragments from pLelF C: 
20 (a) 35 b.p. from Sau 3a to Sau 96 20 

(b) >900 b.p. Sau 96 to Pst I 

(c) Isolate an approximate 300 b.p. fragment (Hind III — Sau 3a) from pLelF A-25 as in part K (4) 

supra. 

(d) Isolate the approximately 3600 b.p. fragment of part K (5) supra. 



25 



25 Construction: 

(1) Ugate (a) and (c). Cleave with BgJil, Hindlll and isolate the approximately 335 b.p. product 

(2) Triple ligate {1 )+{b)+(d) and transform E. coli with the resulting plasmid. 
A representative clone made in this manner is E coli K-1 2 strain 294/pLelF C trp 35. 

LeiF D 

30 Isolate from pLelF D. 30 

a) 35 b.p. from Sau3a to Avail 

b) 1 50 b.p. from Avail to Bglll 

c) approx. 700 b.p. from Bglll to Psti 

Isolate from pLelF A25: 

35 d) 300 b.p. from Hindlll to Sau3a 35 

Isolate from pBR322: 

e) approx. 3600 b.p. from Hindlll to Pst I 

Construction: 

(1 ) ligate (a)+(b), cut with Bglll and purify a b.p. product (1 ). 
40 (2) ligate (1)+(d), cut with Hindlll, Bgttll, and purify the approx. 500 b.p. product (2). 40 

(3) ligate (2)+{c)+(e) and transform E. coli with the resulting plasmid. 

A representative clone made in this manner is E. coli K-1 2 strain 294/pLelFD trp 1 1 . 

LeIFF 

The LeIF F containing fragment may be tailored for direct expression through reassembly made 
45 convenient by the complete homology of amino acids 1 — 1 3 of LeIF B and LeIF F. A trp promoter- 45 
containing fragment (a) with appropriately configured ends is obtained from pHGH207, described 
above, via Pst I and Xba I digestion followed by isolation of the ca. 1 050 b.p. fragment A second 
fragment (b) is obtained as the larger of the fragments resulting from Pst I and Bglll digestion of the 
plasmid pHKY 10. Fragment (a) contains approximately half the gene encoding ampiciliin resistance; 
50 fragment (b) contains the remainder of that gene and the entire gene for tetracycline resistance save 50 
for the associated promoter. Fragments (a) and (b) are combined via T4 ligase and the product treated 
with Xba I and Bglll to eliminate dimerization, forming a fragment (c) comprising the trp promoter- 
operator and genes for tetracycline and ampiciilin resistance. 

A fragment (d) of approximately 580 b.p. is obtained by Avail and Bglll digestion of pLelF F. This * 
55 comprises codons for amino acids 1 4 — 1 66 of LeIF F. 55 
A fragment (e) (49 b.p.) is obtained by Xba I and Avail digestion of pLelF B. Fragment (e) encodes 
amino acids 1 — 1 3 of LeIF F. 

Fragments (c), (d) and (e) are triple ligated in the presence of T4 ligase. The cohesive ends of the 
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respective fragments are such that the composite piasmid circularizes correctly, bringing the tetracycline 
resistance gene under the control of the trp promoter-operator along with the gene for mature LeIF F, 
such that bacteria transformed with the desired piasmid may be selected on tetracycline-containing 
plates. A representative done prepared in this manner is E. coli K-1 2 strain 294/pLelF F trp 1 . 

5 LeiF H 5 
The complete LeIF H gene may be configured for expression as a mature leukocyte interferon as 
follows: 

1 . Piasmid pLelF H is subjected to Haell and Rsal digestion with isolation of the 81 6 b.p. 
fragment extending from the signal peptide amino add 10 to the 3' noncoding region. 
tO 2. The fragment is denatured and subjected to repair synthesis with Klenow fragment of DNA 1 0 

polymerase I (Klenow et al., Proc. Natl. Acad. Sci. U.S.A. 65, 1 68 [1 970]), employing the synthetic 
deoxyoligonudeotide dAATTCTGCAG (an EcoRI — PstI convenor) was ligated to the 2000 b.p. EcoRI 

3. The resulting product Is cleaved with Sau3a and a 452 b.p. fragment representing amino acids 
1 to 1 50 isolated. 

15 4. Sau3a and PstI digestion of pLelF H and Isolation of the resulting 500 b.p. fragment yields a 15 

gene encoding amino acids 1 50 through the end of the coding sequence. 

5. Fragments Isolated in steps (3) and (4) are ligated to form a fragment: 

1 166 
met cys , asp stop 

ATGTGT. . 1 . . . GATTGA — . . . — Pst I 

Sau3a 

encoding the 1 66 amino acids of LeIF H. 
20 6. pLelF A trp 25 is digested with Xbal, blunt ended with DNA polymerase I and the product 20 

digested with Pst I. The large resulting fragment may be isolated and ligated with the product of step 
(5) to form an expression piasmid capable, upon transformation of E. coll K-1 2 strain 294 or other host 
bacteria, of expressing mature LeIF H. 



LeiF I 

25 The phase X Charon 4A recombinant library of the human genome constructed by Lawn et al., 25 
Cell 15, 1 1 57 (1 978), was screened for leukocyte interferon genes by procedures 'described by Lawn 
et al. (supra) and Maniatis et al.. Cell 15, 687 (1 978). A radioactive LeIF probe derived from the cDNA 
clone LeIF A, was used to screen approximately 500,000 plaques. Six LeIF genome clones were 
obtained in this screening. Following rescreening and plaque purification one of these clones, AHLelF2, 

30 was selected for further analysis. 30 
Using the method described above, other probes can be used to advantage to isolate additional 
LeIF clones from the human genome. These, in turn, can be employed to produce additions! leukocyte 
interferon proteins in accordance with this invention. 

1 . The 2000 b.p. EcoRI fragment of clone AHLelF2 was subcloned into pBR325 at the EcoRI site. 

35 The resulting piasmid LeIF I was cleaved with EcoRI and the 2000 b.p. fragment isolated. The 35 
deoxyoligonudeotide dAATTCTGCAG (an EcoRI — Psti convenor) was ligated to the 2000 b.p. EcoRI 
fragment and the resulting product deaved with Pst) to give a 2000 b.p. fragment containing PstI ends. 
This was cleaved with Sau96 and a 1 100 b.p. fragment isolated which has one PstI and one Sau96 
end. 

40 2. The piasmid pLelF C trp 35 was digested with PstI and Xbal. The large fragment was isolated. 40 

3. The small Xbal — PstI fragment from pLelF C trp 35 was digested with Xbal and Sau96. A 40 
b.p. Xbal — Sau96 fragment was isolated. 

4. The fragments isolated In steps (1 ), (2) and (3) were ligated to form the expression piasmid 
pLelFltrpl. 

45 LeIF J 45 

1 . The piasmid pLelF J contains a 3.8 kilobase Hindltl fragment of human genomic DNA which 
includes the LeIF J gene sequence. A 760 b.p. Odel — Rsal fragment was isolated from this piasmid. 

2. The piasmid pLelF B trp 7 was cleaved with Hindlll and Ddel and a 340 b.p. Hindlll — Ddel 
fragment isolated. 

50 3. The piasmid pBR322 was cleaved with PstI, blunt ended by incubation with DNA polymerase I 50 

(Klenow fragment), then digested with Hindlll. The large (~3600 b.p.) fragment was isolated. 

4. Fragments Isolated in steps (1 ), (2) and (3) were ligated to form the expression piasmid pLelF J 

trp 1. 

M. Purification 

55 The content of leukocyte interferon in bacterial extracts may be enhanced by successive: 55 

1 . PolyethyJenenmine precipitation, in which most of the cellular protein, including the interferon, 
remains in the supernatant. 
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2. Ammonium sulfate fractionation, in which interferon comes out of soiution of 55% saturated 
ammonium sulfate. 

3. Suspension of the ammonium sulfate pellet in 0.06 M potassium phosphate, 10 mM Tris-HCI, 
pH 7.2, and dialysis against 25 mM Tris-HCI, pH 7.9 (interferon activity remains in solution). 

5 4. Chromatography of the above supernatant pH adjusted to 8.5, on a DEAE-cellulose column 5 

(eluting with a linear gradient of 0 to 0-2 M NaCI in 25 mM Tris-HCI, pH 8.5). * 

5. Adsorption on Ctbachrome Biue-Agarose or hydroxylapatite and elution with 1 .5 M KCI or 0.2 

M phosphate solution respectively (optional). « 

6. Molecular sizing on a Sephadexo G-75 column. 

10 7. Cation exchange chromatography on CM-celluIose in 25 mM ammonium acetate at pH 5.0. 1 0 

developed with an ammonium acetate gradient (to 0.2 M ammonium acetate). 
The above process yields material of >95% purity. 

The material can also be purified by size exclusion chromatography, reverse phase (RP-8) high 
pressure liquid chromatography or affinity chromatography on immobilized antiinterferon antibodies. 
1 5 Alternatively the material from step 4 above can be loaded on a monoclonal antibody column, 1 5 

prepared as described by Milstein, Scientific American 243, 66 ( 1 980), and eluted with 0.2 M acetic 
acid, 0.1 % Triton and 0.1 5 M NaCI. 

In an alternative, preferred embodiment the leukocyte interferon produced by the procedure 
described hereinbefore can be purified by the following steps: 
20 1 . Frozen cell pellets containing the expressed leukocyte interferon are broken up manually or by 20 

appropriate size reduction equipment. The partially thawed cells are suspended in 4 volumes of buffer 
A. containing 0.1 M Tris adjusted to pH 7.5 — 8.0, 1096 (w/v) sucrose, 0.2 M NaCI, 5 mM EDTA, 0.1 
mM PMSF and 10 — 1 00 mM MgCl 2 . The suspension is held to approximately 4°C. 

The suspension is passed through a homogenize r at about 6000 psi followed by a second pass at 
25 less than 1 000 psi. Effluent from the homogenizer from both passes is cooled in an ice bath. 25 

2. Polyethylene-imine is added slowly to the homogenate to a concentration of about 0.35% and 
allowed to stand for about 30 minutes. The solids are removed by centrifugation or filtration. This step 
is temperature controlled or performed sufficiently quickly that the supernatant (filtrate) is kept at less 
than 1 0°C. The supernatant (filtrate) is concentrated by ultrafiltration to approximately 1/1 0 the 

30 original volume. Particulate matter or haziness in the retentate may be removed by an appropriate filter 30 
such as a mtcroporous membrane. 

3. The clarified solution is loaded directly onto a monoclonal antibody column at a flux of 5 — 8 
cm/hr. (e.g. 25 — 40 ml/hr. on 2.6 cm diameter column). After loading the column is washed with 
approximately 1 0 column volumes of 25 mM Tris-HCI, pH 7.5 — 8.5, including NaCI (0.5 M) and 

35 surfactant such as Triton X-1 00 (0.2%) or equivalent. Following the wash the column is rinsed with 35 
about 10 column volumes of solution containing 0.1 5 M NaCI and surfactant such as Triton X-1 00 
(0.1%) or equivalent. The column Is eluted with 0.2 M acetic acid containing surfactant such as Triton 
X-1 00 (0.1%) or equivalent. The protein peak from the monoclonal antibody column (as determined by 
UV aborbence or other convenient assay) is pooled and the pH adjusted to approximately 4.5 with 1 N 

40 NaOH or 1.0 M Tris base, 4q 

4. The pooled interferon peak is loaded onto a cationic exchanger such as Whatman CM52 
cellulose or equivalent which has been equilibrated with a suitable buffer such as ammonium acetate 
pH 4.5 (50 mM). After loading, the column is washed with equilibrating buffer until the UV absorbence 
of the effluent has reached a plateau so that little additional protein is eluting from the column. The 

45 column is then eluted with 25 mM ammonium acetate/0.1 2 M sodium chloride or a combination 45 
which optimizes recovery of interferon and affords a fyophilized cake having satisfactory appearance 
and solubility properties. 

The monoclonal antibodies employed In the preferred embodiment described above can be 
prepared by the procedure described by Staehelin et a!., Proc. Natl. Acad. Sci. U.S.A. 78, 1 848 — 52 

50 (1981). Monoclonal antibodies are purified and covalently linked to Afflgel-1 0 as described below: 50 

Preparation and Purification of Monoclonal Antibodies from Ascitic Fluid. 

Five female Balb/c mice were each inoculated with 5 to 1 0x 1 0 6 hybridoma cells from mid-log 
growth phase. About 5x10* viable cells obtained from the mouse producing fluid were inoculated 
intraperitoneally into each of 1 0 or more mice. The ascitic fluid was collected repeatedly (2 to 4 times) 
55 from each mouse. Up to three transfers and collections may be performed from one group of mice to 55 
the next Ascitic fluid from mice at each transfer was pooled. 

Cells and debris were removed from the ascitic fluid by low speed centrifugation (500—1 OOOxg) * 
for 1 5 minutes. Then centrifugation was performed for 90 minutes at 1 8,000 rpm. The supernatant 
was frozen and stored at — 20°C. After thawing, additional fibrin and particulate material were 
60 removed by centrifugation at 3 5,000 rpm for 90 minutes. Batches of ascitic fluid from each transfer 60 
were tested for specific antibody activity by a solid phase antibody binding assay (Staehelin et al., 
supra) and pooled if found satisfactory. 

Concentration of protein in the pooled solutions was estimated by the approximation that 1 mg of 
protein yields and absorbance of 1 .2 at 280 nm in a cuvette with a path length of 1 .0 cm. Ascites fluids 
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with high levels of antibody contain 30 to 35 mg protein/ml. This is equivalent to 4 — 7 mg of specific 
antibody/ml. The fluid was diluted with PBS (0.01 M sodium phosphate. pH 7.3, 0.1 5 M NaCI) to a 
protein concentration of 1 0 to 1 2 mg/ml. 

To each 100 ml of diluted solution, 90 ml of room temperature saturated ammonium sulfate 
5 solution was added slowly with vigorous stirring at 0°C. The suspension was kept in ice for 40 to 60 5 
minutes, then centrifuged for 1 5 minutes at 1 0,000 rpm at 4°C. The supernatant was decanted and 
drained well. The protein pellets were dissolved in 0.02 M Tris.HCI (pH 7.9)/0.04 M NaCI (Buffer I). The 
protein solution was dialyzed for 1 6 to 1 8 hours at room temperature against 1 00 volumes of Buffer I 
with at least one change of the buffer. The dialyzed solution was centrifuged at 1 5,000 rpm for 1 0 

10 minutes to remove undissolved material. About 30 — 35% of the original amount of total protein in the 10 
ascitic fluid was recovered as estimated by absorption at 280 nm. 

The solution containing 30 — 40 mg of protein per ml was then applied to a column of DEAE- 
cellulose equilibrated with Buffer I. A column bed volume of at least 1 00 ml was used for each gram of 
protein applied. The antibody was eluted from the column wfth a linear NaCI gradient containing 0.02 

1 5 M Tris . HQ, pH 7.9, from 0.04 M to 0.5 M NaCI. Pooled peak fractions eluting between 0.06 and 0.1 1 5 
M NaCI were concentrated by precipitation with an equal volume of room temperature saturated 
ammonium sulfate and centrifugation. The protein pellets were dissolved in 0.2 M NaHC0 3 
(pH~8.0)/0.3 M NaCI (Buffer II) followed by dialysis against three changes of the same buffer at room 
temperature. The dialyzed solutions were centrifuged at 2 0,000 xg for 1 5 minutes to remove any 

20 insoluble material. Protein concentration was adjusted to 20 to 25 mg/ml with Buffer IL 20 

Preparation of Immunoadsorbarrts. 

Affigel-1 0 {BioRad Laboratories, Richmond, California) was washed on a sintered glass filter three 
times with ice-cold isopropanol followed by three washed with ice-cold distilled water. The gel slurry 
(—50% in cold water) was transferred to plastic tubes and sedimented by a brief centrifugation. The 
25 supernatant was aspirated. The packed gel was mixed with an equal volume of purified antibody 25 
solution and rotated end-over-end at 4°C for 5 hours. After reaction, the gel was centrifuged, then 
washed twice with Buffer III (0. 1 M NaHCCX/0. 1 5 M NaCI) to remove uncoupled antibody. Protein 
determination of the combined washes revealed that more than 90% of antibody was coupled to the 
gel. 

30 To block unreacted sites, the gel was mixed with an equal volume of 0.1 M ethanolamine. HCI 30 

(pH 8) and rotated end-over-end at room temperature for 60 minutes. The gel slurry was washed free 
of reactants with PBS and stored in PBS in the presence of 0.02% (w/v) sodium azide at 4°C. 



N. Parenteral Administration 

LeIF may be parenterally administered to subjects requiring antitumor or antiviral treatment, and 
35 to those exhibiting immunosuppressive conditions. Dosage and dose rate may parallel that currently in 35 
use in clinical investigations of human derived materials, e.g., (about 1 — T0)x 1 0 8 units daily, and in the 
. case of materials of purity greater than 1 %, likely up to, e.g. 5x 1 0 7 units daily. 

As one example of an appropriate dosage form for essentially homogeneous bacterial LeIF in 
parenteral form, 3 mg LeIF of specific activity of, say, 2 x 1 0 8 unrts/mg may be dissolved in 25 ml of 5 N 
40 human serum albumin, the solution is passed through a bacteriological filter and the ffitered solution 40 
aseptically subdivided into 1 00 vials, each containing 6x 1 0 6 units pure interferon suitable for 
parenteral administration. The vials are preferably stored in the cold (— 20° C) prior to use. 

The compounds of the present invention can be formulated according to known methods to 
prepare pharmaceutically useful compositions, whereby the polypeptide hereof is combined in 
45 admixture with a pharmaceutically acceptable carrier vehicle. Suitable vehicles and their formulation 45 
are described in Remington's Pharmaceutical Sciences by E. W. Martin, which is hereby incorporated 
by reference. Such compositions will contain an effective amount of the interferon protein hereof 
together with a suitable amount of vehicle in order to prepare pharmaceutically acceptable 
compositions suitable for effective admistration to the host One preferred mode of administration is 
50 parenteral. gQ 

Claims 

1 . A polypeptide comprising the amino acid sequence of a mature human leukocyte interferon, 
unaccompanied by any presequence. 

2. A polypeptide according to claim 1 , unaccompanied by associated glycosylation. 

55 3. A polypeptide comprising the amino acid sequence of a mature human leukocyte interferon, 55 

containing the amino acid methionine attached to the N-terminus of the ordinarily first amino acid of 
said interferon. 

4. A polypeptide comprising the amino acid sequence of a mature human leukocyte interferon, 
containing a deavable conjugate or microbial signal protein attached to the N-terminus of the 

60 ordinarily first amino acid of said interferon. qq 

5. A polypeptide according to any one of claims 1 to 4 wherein the amino acid sequence of said 
interferon includes tne partial sequence Cys-Ala-Trp-Glu-Val-Val-Arg-AIa-Glu-lle-Met-Arg-Ser. 
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6. A polypeptide according to any one of claims 1 to 4 wherein the sequence of said interferon 
includes the amino acids specified and positioned as indicated in the sequence "Ail" of Figure 4. 

7. A polypeptide according to any one of claims 1 to 6 selected from the group consisting of 
human leukocyte interferon (LeIF) A. B, C, D, F, H, I, and J. 

8. Human leukocyte interferon A (LeIF A). 

9. Human leukocyte interferon B (LeIF B). 

1 0. Human leukocyte interferon C (LeIF C). 

1 1 . Human leukocyte interferon D (LeIF D). 

12. Human leukocyte interferon F (LeIF F). 

13. Human leukocyte interferon H (LeIF H). 

14. Human leukocyte interferon I (LeIF I). 

1 5. Human leukocyte Interferon J (LeIF J). 

1 6. A polypeptide of human leukocyte Interferon according to any one of claims 1 to 1 5 
microbiaily produced. 

1 7. A polypeptide of human leukocyte interferon according to any one of claims 1 to 1 5 
bacterially produced. 

1 8. A polypeptide of human leukocyte interferon acccording to any one of claims 1 to 1 5 
produced by E. coll.. 

1 9. A DNA sequence comprising the sequence coding for the amino acid sequence of a mature 
human leukocyte interferone unaccompanied by any p resequence. 

20. A DNA sequence comprising the sequence coding for the amino acid sequence of a mature 
human leukocyte interferon containing the amino acid methionine attached to the N-terminus of the 
ordinarily first amino acid of said interferon. 

21. A DNA sequence comprising the sequence coding for the amino acid sequence of a mature 
human leukocyte interferon containing a deavable conjugate or microbial signal protein attached to 
the N-terrninus of the ordinarily first amino add of said Interferon. 

22. A DNA sequence as daimed in any one of claim 1 9 to 21 containing a sequence coding for 
the partial amino add sequence Cys-Ala-Trp^lu-Val-Val-Aj^Ala^lu^le-Met-Arg-Ser. 

23. A DNA sequence as claimed in any one of claims 1 9 to 21 containing codons coding for the 
amino acids of an interferon spedfied and positioned as indicated in the sequence "All" of Figure 4. 

24. A DNA sequence as daimed in any one of claims 1 9 to 23 cooing for a polypeptide selected 
from the group consisting of human leukocyte interferon (LeIF) A, B, C, D, F, H, I and J. 

25. A DNA sequence coding for human leukocyte interferon A (LeIF A). 

26. A DNA sequence coding for human leukocyte Interferon B (LeIF B). 

27. A DNA sequence coding for human leukocyte interferon C (LeIF C). 

28. A DNA sequence coding for human leukocyte interferon D (LeIF D). 

29. A DNA sequence coding for human leukocyte interferon F (LeIF F). 

30. A DNA sequence coding for human leukocyte interferon H (LeIF H). 

31 . A DNA sequence coding for human leukocyte interferon I (LeIF I). 

32. A DNA sequence coding for human leukocyte interferon J (LeIF J). 

33. A DNA sequence selected from the group consisting of sequences A, B, C, D, F, and H of 
Figure 3. 

34. A DNA sequence selected from the group consisting of sequences A, C, H, I, and J of Figure 8. 

35. A DNA sequence according to any one of claims 1 9 to 34 operably linked with a DNA 
sequence capable of effecting microbial expression of a polypeptide according to any one of claims 1 to 
15. 

36. A DNA sequence according to any one of daims 1 9 to 34 operably linked with a DNA 
sequence capable of effecting expression of a polypeptide according to any one of claims 1 to 1 5 in a 
bacterium. 

37. A DNA sequence according to any one of claims 1 9 to 34 operably linked with a DNA 
sequence capable of effecting expression of a polypeptide according to any one of claims 1 to 1 5 in E. 
coli. 

38. A replicable microbial expression vehlde capable, in a transformant microorganism, of 
expressing a polypeptide according to any one of daims 1 to 1 5. 

39. A replicable bacterial expression vehicle capable, in a transformant bacterium, of expressing a 
polypeptide according to any one of claims 1 to 1 5. 

40. A replicable E. coli expression vehide capable, in a transformant E. coli, of expressing a 
polypeptide according to any one of claims 1 to 1 5. 

41 . A microbial expression vehicle according to any one of claims 38 4 0 which is a plasmid. 

42. A plasmid selected from the group consisting of pLel FA trp 25, pLelF B tip 7, and pLelF F tip 
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43. A plasmid selected from the group consisting of PLelF C trp 35 and pLelF D trp 1 1 . 

44. A plasmid selected from the group consisting of pLelF G and pLelF H. 

45. A plasmid selected from the group consisting of pLelF I trp 1 and pLelF J trp 1. 

46. A microorganism transformed with a vehide as daimed in any one of claims 38 to 45. 
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1 k ll ' a Ph .l rm3C f U ^ ca ' ^P 05 ™'™ as claimed in claim 52 for parenteral administration 

dalmsltols "* prcrticBon of a compound claimed in «,y of 25 

62. An expression vehicle as claimed in anv one of rJ»imc *-*o >i c k 30 
microbial production of a compound ^S^Sf^Xln^^ 

claimed r^T^ t0 49 °< a °?««? <* cells as 

35 claims 1 to 1 5 wnenever used in the production of a compound claimed in any one of 

45 m*™^^ "pable. in a transformant 

67. The products and processes for their preparation as hereinbef ore described. 

Pttafd (or H^Maj^. Station^ Ota by ft. Cowta Pnm. Lwntagm Sp.. 1 882. PubS**d by ft, P« M OnTc 
25 Southampton Buftfcgs. London. VWC2A 1AY. from which ccptw may b« obtained. 



